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Customer No.00270 

1 UPN-P3067-P3 
METHODS OF GENERATING CHIMERIC ADENOVIRUSES 
AND USES FOR SUCH CHIMERIC ADENOVIRUSES 

BACKGROUND OF THE INVENTION 

The presence of humoral immunity (circulating antibodies) to adenovirus 
caps,d proteins is , batrier a tlx UJe „ f ^ for ^ ^ ^ 

prorotype adenovirus vectors that have been developed for gene rherupy are based on 

subgroup C adenoviruses such as that of serotype 5. The prevalence of neutralizing 

annbodies against subgreup C adenoviruses is g.„era,ly high in human popu , alions „ 

a result of frequent exposure ,„ these pathogen, This fac, is like,, to greatly limit the 

effecveness of gene rherapy vectors based on serotypes such as AdS 

Analysis of the nature of the protective antibodies against adenoviruses has 
indicated ft* me most jmponant ^ „ ^ ^ ^ ^ ^ 

0988)7. V Mtl , 2321; Gall a,*,. (,096)7. M 70, 2, ,6). Severe, effort, have 
been made to engineer the hexon so as to evade the anti-hexon antibodies by making 
chrmenc adenoviruses harboring hexons f,„ m other serotypes [Roy a, „/. ( 1998) J 
Virol. 72, 6875; US Paten, No 59223,5; Gal, „al. (1998)7 W. 72, ,0260; Youi, 
« of. (2002)^. W ra e,.,3,31,;Wu« a /. (2002)7 vlroL 76 

However, mis has been hrgely unsuacessnr, whan exchanges among distant serotype, 
are attempted. v 

Alternatively, investigators have proposed using adenovirus vectors that 
rare.y cause human infections or using adenoviruses from non-human sources 
However, the lack of a practical manner in which to produce large numbers of such 
vectors has proved to be a hindrance to developing such vectors. 



SUMMARY OF THE INVENTION 

The present invention provides a method of modifying adenoviruses having capsids 
30 and particularly, inc.uding hexons, from serotypes which are not wel. adapted for ' 
growth in cells useful for adenoviral virion production. The method is useful for 
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production of scalable amounts of adenoviruses. The modified, or chimeric, 
adenoviruses are useful for a variety of purposes which are described herein. 
The invention further provides novel, isolated, adenovirus SA1 8 nucleic acid and 
amino acid sequences, vectors containing same, cell lines containing such SA18 
5 sequences and/or vectors, and uses thereof. 

Other aspects and advantages of the present invention will be readily apparent from 
the following detailed description of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

0 Fig. 1 provides the map of the genome of the simian adenovirus generated 

by shotgun cloning as described in the examples below. 

Fig. 2 provides the map of the recombinant Adhu5-SV25 chimeric virus 
termed H5S25H5. 

5 DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides chimeric adenoviruses composed of the left 
terminal end and right terminal end of an adenovirus which can be cultured in the 
selected host cell, and the internal regions encoding, at a minimum, the capsid 
proteins of another adenovirus serotype. This invention is particularly advantageous 
for generating adenoviruses having serotypes which are difficult to culture in a 
desired cell type. The invention thus permits generation of chimeric adenoviruses of 
varying serotypes. 

In the embodiments illustrated herein, chimeric adenoviruses have been 
constructed where most structural proteins, and not merely the hexon or fiber are 
denved from an adenovirus of an unrelated serotype, thereby preserving the majority 
of the protein-protein interactions that are involved in capsid assembly. Most of the 
early genes such as those encoded by the adenovirus El and E4 regions that are 
responsible for transcription regulation and regulation of the host cell cycle are 
retained from a different serotype that is known to result in high titer vims generation 
m the commonly used cell types, such as HEK 293 which supplies the Ad5 El 
proteins in trans. 

In another embodiment, the invention provides novel nucleic acid and amino 
acid sequences from Ad SA18, which was originally isolated from vervet monkey 



[ATCC VR-943]. The present invention further provides novel adenovirus vectors 
and packaging cell lines to produce those vectors for use in the /„ vitro production of 
recombinant proteins or fragments or other reagents. The invention further provides 
composes for use in delivering a heterologous molecule for therapeutic or vaccine 
purposes. Such therapeutic or vaccine compositions contain the adenoviral vectors 
carrymg an inserted heterologous molecule. In addition, novel sequences of the 
invention are useful in providing the essential helper functions required for 
production of recombinant adeno-associated viral (AAV) vectors. Thus, the invention 
prov.des helper constructs, methods and cel. lines which use these sequences in such 
production methods. 

The term "substantial homology" or "substantial similarity," when referring to 
a nucle.c acid or fragment thereof, indicates that, when optimally aligned with 
appropriate nucleotide insertions or deletions with another nucleic acid (or its 
complementary strand), there is nucleotide sequence identity in at least about 95 to 
99% of the aligned sequences. 

The term "substantial homology" or "substantial similarity," when referring to 
ammo ac.ds or fragments thereof, indicates that, when optimally aligned with 
appropriate amino acid insertions or deletions with another amino acid (or its 
complementary strand), there is amino acid sequence identity in at least about 95 to 
99o/o of the aligned sequences. Preferably, the homo.ogy is over full-length sequence 
or a protein thereof, or a fragment thereof which is at least 8 amino acids, or more 
des,rably, at least 1 5 amino acids in length. Examples of suitable fragments are 
described herein. 

The term "percent sequence identity" or "identical" in the context of nucleic 
acd sequences refers to the residues in the two sequences that are the same when 
ahgned for maximum correspondence. The length of sequence identity comparison 
may be over the fu.l-.ength of the genome (e.g., about 36 kbp), the tul.-length of an 
open reading frame of a gene, protein, subunit, or enzyme [see, e.g., the tables 
proving the adenoviral coding regions], or a fragment of at least about 500 to 5000 
nucleotides, is desired. However, identity among smaller fragments, e.g., of at least 
about mne nucleotides, usually at least about 20 to 24 nucleotides, at least about 28 to 
32 nudeotides, at least about 36 or more nucleotides, may also be desired. Similarly 
"percent sequence identity" may be readily determined for amino acid sequences 
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over the full-length of a protein, or a fragment thereof. Suitably, a fragment is at least 
about 8 amino acids in length, and may be up to about 700 amino acids. Examples of 
suitable fragments are described herein. 

Identity is readily determined using such algorithms and computer programs 
as are defined herein at default settings. Preferably, such identity is over the full 
length of the protein, enzyme, subunit, or over a fragment of at least about 8 amino 
acids in length. However, identity may be based upon shorter regions, where suited to 
the use to which the identical gene product is being put. 

As described herein, alignments are performed using any of a variety of 
publicly or commercially available Multiple Sequence Alignment Programs, such as 
"Clustal W», accessible through Web Servers on the internet. Alternatively, Vector 
NT1 utilities are also used. There are also a number of algorithms known in the art 
that can be used to measure nucleotide sequence identity, including those contained in 
the programs described above. As another example, polynucleotide sequences can be 
compared using Fasta, a program in GCG Version 6. 1 . Fasta provides alignments and 
percent sequence identity of the regions of the best overlap between the query and 
search sequences. For instance, percent sequence identity between nucleic acid 
sequences can be determined using Fasta with its default parameters (a word size of 6 
and the NOPAM factor for the scoring matrix) as provided in GCG Version 6.1, 
herein incorporated by reference. Similarly programs are available for performing 
amino acid alignments. Generally, these programs are used at default settings, 
although one of skill in the art can alter these settings as needed. Alternatively, one of 
skill in the art can utilize another algorithm or computer program that provides at least 
the level of identity or alignment as that provided by the referenced algorithms and 
25 programs. 

As used throughout this specification and the claims, the term "comprise" and 
its variants including, "comprises", "comprising", among other variants, is inclusive 
of other components, elements, integers, steps and the like. The term "consists of or 
"consisting of are exclusive of other components, elements, integers, steps and the 
30 like. 

Except where otherwise specified, the term 'Vector" includes any genetic 
element known in the art which will deliver a target molecule to a cell, including, 
naked DNA, a plasmid, phage, transposon, cosmids, episomes, viruses, etc. 
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By "minigene" is meant the combination of a selected heterologous gene and 
the other regulatory elements necessary to drive translation, transcription and/or 
expression of the gene product in a host cell. 

As used herein, the term "transcomplement" refers to when a gene (gene 
product) of one adenovirus serotype supplies an adenovirus serotype lacking this gene 
(gene product) from another serotype with the missing function. For example, human 
adenovirus serotype 5 Ela and Elb functions are known to transcomplement El- 
deleted chimpanzee adenovirus Pan 9. Similarly, the inventors have found that 
human Ad5 El transcomplements El-deleted chimpanzee adenovirus serotypes Pan5, 
Pan6, Pan7, and simian adenovirus serotypes SVl, SV25 and SV39. Other examples 
of transcomplementing serotypes include human AdS and human Ad2, Ad3, Ad4, 
Ad5, Ad7, and Adl2. 

The term "functionally deleted" or "functional deletion" means that a 
sufficient amount of the gene region is removed or otherwise damaged, e.g., by 
mutation or modification, so that the gene region is no longer capable of producing 
functional products of gene expression. If desired, the entire gene region may be 
removed. Other suitable sites for gene disruption or deletion are discussed elsewhere 
in the application. 

The term "functional" refers to a product (e.g., a protein or peptide) which 
performs its native function, although not necessarily at the same level as the native 
product. The term "functional" may also refer to a gene which encodes and from 
which a desired product can be expressed. 

I. Chimeric Adenoviral Vectors 

The compositions of this invention include chimeric adenoviral vectors that 
deliver a heterologous molecule to cells. For delivery of such a heterologous 
molecule, the vector can be a plasmid or, preferably, a chimeric adenovirus. The 
chimeric adenoviruses of the invention include adenovirus DNA from at least two 
source serotypes, a "donating serotype" and a "parental adenovirus" as described in 
more detail herein, and a minigene. » 

Because the adenoviral genome contains open reading frames on both strands 
m many instances reference is made herein to 5' and 3' ends of the various regions to 
avoid confusion between specific open reading frames and gene regions. Thus, when 



10 



15 



20 



25 



30 



reference is made herein to the "left" and "right" end of the adenoviral genome, this 
reference is to the ends of the approximately 36 kb adenoviral genome when depicted 
in schematic form as is conventional in the art [see, e.g., Horwitz, "Adenoviridae and 
Their Replication", in Virology, 2d ed., pp. 1679-1721 (1990)]. Thus, as used 
herein, the "left terminal end" of the adenoviral genome refers to portion of the 
adenoviral genome which, when the genome is depicted schematically in linear form, 
is located at the extreme left end of the schematic. Typically, the left end refers to be' 
portion of the genome beginning at map unit 0 and extending to the right to include at 
least the 5' inverted terminal repeats (ITRs), and excludes the internal regions of the 
genome encoding the structural genes. As used herein, the "right terminal end" of the 
adenoviral genome refers to portion of the adenoviral genome which, when the 
genome is depicted schematically in linear form, is located at the extreme right end of 
the schematic. Typically, the right end of the adenoviral genome refers to be portion 
of the genome ending at map unit 36 and extending to the left to include at least the 3' 
ITRs, and excludes the internal regions of the genome encoding the structural genes. 
A. Adenovirus Regulatory Sequences 
1 • Serotype 

The selection of the adenovirus serotype donating its left 
terminal end and right terminal end can be readily made by one of skill in the art from 
among serotypes which can readily be cultured in the desired cell line. Among other 
factors which may be considered in selecting the serotype of the donating serotype is 
compatibility with the adenovirus serotype which will be supplying the internal 
regions at the location at which their sequences are hybridized. 

Suitable adenoviruses for donating their left and right termini 
are available from the American Type Culture Collection, Manassas, Virginia, US 
(ATCC), a variety of academic and commercial sources, or the desired regions of the 
donating adenoviruses may be synthesized using known techniques with reference to 
sequences published in the literature or available from databases (e.g., GenBank, 
etc.). Examples of suitable donating adenoviruses include, without limitation, human 
adenovirus serotypes 2, 3, 4, 5, 7, and 12, and further including any of the presently 
.dent.fied human types [see, e.g., Horwitz, "Adenoviridae and Their Replication" in 
Virology, 2d ed., pp. 1679-1721 (1990)] which can be cultured in the desired ceil 
Similarly adenoviruses known to infect non-human primates (e.g., chimpanzees, 
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rhesus, macaque, and other simian species) or other non-human mammals and which 
grow in the desired cell can be employed in the vector constructs of this invention. 
Such serotypes include, without limitation, chimpanzee adenoviruses Pan 5 [VR-591], 
Pan6 [VR-592], Pan? [VR-593], and C68 (Pan9), described in US Patent No. 
6,083,716; and simian adenoviruses including, without limitation SV1 [VR-195]- 
SV25 [SV.201]; SV35; SV15; SV-34; SV-36; SV-37, and baboon adenovirus [VR- 
275], among others. The sequences of Pan 5 (also termed C5), Pan 6 (also termed 
C6), Pan 7 (also termed C7), SV1, SV25, and SV39 have been described [WO 
03/046124, published 5 June 2003; and in US Patent Application No. 10/739,096, 
filed December 19, 2003)], which are incorporated by reference. In the following 
examples, the human 293 cells and adenovirus type 5 (Ad5), Pan9, and Ad40 are used 
for convenience. However, one of skill in the art will understand that other cell lines 
and/or comparable regions derived from other adenoviral strains may be readily 
selected and used in the present invention in the place of (or in combination with) 
these serotypes. 

2. Sequences 

The minimum sequences which must be supplied by the 
adenovirus donating its left terminal end and its right terminal end include the 5' cis- 
elements and the 3' cis-elements necessary for replication and packaging. Typically, 
the 5' cis-elements necessary for packaging and replication include the 5' inverted 
terminal repeat (ITR) sequences (which functions as origins of replication) and the 
native 5' packaging enhancer domains (that contain sequences necessary for 
packaging linear Ad genomes and enhancer elements for the El promoter). The right 
end of the adenoviral genome includes the 3' cis-elements (including the ITRs) 
necessary for packaging and encapsidation. Desirably, the adenovirus serotype 
donating its left and right termini and/or an adenovirus serotype which 
transcomplements the serotype of the donating adenovirus, further provides the 
functions of the necessary adenovirus early genes, including El (Ela and Elb) E2 
(E2a and E2b), and E4 (including at least the ORF6 region). E3 is not essentia! and 
may be deleted as desired, e.g., for insertion of a transgene in this region or to provide 
space for a transgene inserted in another region (typically for packaging it is desirable 
for the total adenoviral genome to be under 36 kb). 
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In certain embodiments, the necessary adenovirus early koos 
^~^««-* — moaretembodimet, ^ r 

O fonauy one or more of the necessary adenovira, sti-uctura, genes by 
me adenovnus donating iB , eft tt(mina| and jB rig|n Knn . na] 
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P VIII, pTP, 52/55 kDa protem, protein V „, M „ and/or protein VI from the 
adenoma aerobe ^ to , eft Md ^ ^ * 

ecrved from the adenovirus serotype donating the 5. and 3" ITRs, , 

elf :: rUS T? ^ d0na,i " 8 *» - 5 ' ™ the 

com JIT"" ,hePTPMnan< " heremb0 ' ,1 -«.*«Himericade„0 V i„s 
comams rhe teft end of the donating adenovirus serotvp. from lne 5 , 1TR TT „ 
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Ptotem ,„a protein, Suitab.y, titese genes express function* plfc 
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dtese protems (whether hybrid or non-hybrid, can be functional 
hrmerrc adenovirus. Where desired, any necessary proteins funLatiy e,«ed ,„ 
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B. Parental Adenovirus Structural Proteins 
1 . Serotypes 

This invention is particularly well adapted for use in 
generating chimeric adenoviruses in which the capsid proteins are from a parental 
5 adenovirus which does not efficiently grow in a desirable host cell. The selection of 
the parental adenovirus serotype providing the internal regions can be readily made 
by one of skill in the art based on the information provided herein. 

A variety of suitable adenoviruses can serve as a parental 
adenovirus supplying the regions encoding the structural (i.e., capsid proteins). Many 
10 of such adenoviruses can be obtained from the same sources as described above for 
the donating adenovirus serotypes. Examples of suitable parental adenovirus 
serotypes includes, without limitation, human adenovirus serotype 40, among others 
[see, e.g., Horwitz, "Adenoviridae and Their Replication", in Virology, 2d ed., pp. 
1679-1721 (1990)], and adenoviruses known to infect non-human primates (e.g.', 
15 chimpanzees, rhesus, macaque, and other simian species) or other non-human 

mammals, including, without limitation, chimpanzee adenovirus CI, described in US 
Patent No. 6,083,716; simian adenoviruses, and baboon adenoviruses, among others 
In addition, the parental adenovirus supplying the internal regions may be from a non- 
naturally occurring adenovirus serotype, such as may be generated using a variety of 
20 techniques known to those of skill in the art. 

In one embodiment illustrated herein, a chimeric virus that was constructed 
was that between the chimpanzee adenoviruses Pan-5 and CI exhibited a higher titer 
in human 293 cells than the wild-type parental virus. However, the invention is not 
hnuted to the use of these chimpanzee adenoviruses, or to the combination of simian- 
s.mian, human-human, or simian-human chimeric adenoviruses. For example it may 
be desirable to utilize bovine or canine adenoviruses, or other non-human mammalian 
adenov.ruses which do not naturally infect and/or replicate in human cells. 

In the following examples, the human adenovirus type 40 
(Ad40) and the chimpanzee adenovirus C 1 , simian Pan 5 and Ad40, and Pan 5 and 
simian adenovirus SA18, are used. However, one of skill in the art will understand 
that other adenoviral serotypes may be readily selected and used in the present 
invention in the place of (or in combination with) these serotypes. 
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2. Sequences 

The parental adenovirus provides to the chimeric construct of 
the invention its internal regions which includes structural proteins necessary for 
generating a capsid having the desired characteristics of the parental adenovirus. 
These desired characteristics include, but are not limited to, the ability to infect target 
cells and delivery a heterologous transgene, the ability to elude neutralizing 
antibodies directed to another adenovirus serotype (i.e., avoiding clearance due to 
cross-reactivity), and/or the ability to infect cells in the absence of an immune 
response to the chimeric adenovirus. The advantages of such characteristics may be 
most readily apparent in a regimen which involves repeat delivery of adenoviral 
vectors. The left and right termini of the parent adenovirus, including at least the 5' 
ITRs, the El region, the E4 region and the 3' ITRs are non-functional and, preferably 
completely absent. Optionally, all adenovirus regulatory proteins from this parental ' 
adenovirus are non-functional and only the structural proteins (or selected structural 
proteins) are retained. 

At a minimum, the parental adenovirus provides the 
adenoviral late region encoding the hexon protein. Suitably, the parental adenovirus 
further provides the late regions encoding the penton and the fiber. In certain 
embodiments, all of the functional adenoviral late regions, including LI (encoding 
52/55 Da, Ilia proteins), L2 (encoding penton, VII, V, Mu proteins), L3 (encoding VI, 
hexon, Endoprotease), L4 (encoding 100 kD, 33 kD, VIII proteins) and L5 (encoding' 
fiber protein) are supplied by the parental adenovirus. Optionally, one or more of 
these late gene functions, with the exception of those encoding the hexon, penton and 
fiber proteins, can be functionally deleted. Any necessary structural proteins may be 
supplied in trans. 

Thus, in certain embodiments, the chimeric adenovirus further 
contains one or more functional adenovirus genes, including, the Endoprotease open 
reading frame, DNA binding protein, 100 kDa scaffolding protein, 33 kDa protein 
protein VIII, pTP, 52/55 kDa protein, protein VII, Mu and/or protein VI from the ' 
parental adenovirus donating its internal regions. Optionally, one or more of the 
genes can be hybrids formed from the fusion of the donating adenovirus serotype and 
the parental adenovirus serotype providing the capsid proteins, as described above 
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C. The "Minigene" 

Typically, an adenoviral vector of the invention is designed to contain 
a minigene which may be inserted into the site of a partially deleted, fully deleted 
(absent), or disrupted adenoviral gene. For example, the minigene may be located in 
the site of such a functional El deletion or functional E3 deletion, or another suitable 
site. 

The methods employed for the selection of the transgene, the cloning 
and construction of the "minigene" and its insertion into the viral vector are within the 
skill in the art given the teachings provided herein. 

1. The transgene 

The transgene is a nucleic acid sequence, heterologous to the 
vector sequences flanking the transgene, which encodes a polypeptide, protein or 
other product, of interest. The nucleic acid coding sequence is operatively linked to 
regulatory components in a manner which permits transgene transcription, translation, 
and/or expression in a host cell. 

The composition of the transgene sequence will depend upon 
the use to which the adenoviral vector will be put. For example, the adenoviral vector 
may be used as a helper virus in production of recombinant adeno-associated viruses 
or ,n production of recombinant adenoviruses deleted of essential adenoviral gene 
functions which are supplied by the adenoviral vector, or for a variety of production 
uses. Alternatively, the adenoviral vector may be used for diagnostic purposes. 

One type of transgene sequence includes a reporter sequence 
which upon expression produces a detectable signal. Such reporter sequences 
include, without limitation, DNA sequences encoding ^lactamase, P-galactosidase 
(LacZ), alkaline phosphatase, thymidine kinase, green fluorescent protein (GFP) 
chloramphenicol acetyltransferase (CAT), luciferase, membrane bound proteins ' 
■ncluding, for example, CD2, CD4, CDS, the influenza hemagglutinin protein, and 
others well known in the art, to which high affinity antibodies directed thereto exist or 
can be produced by conventional means, and fusion proteins comprising a membrane 
bound protein appropriately fused to an antigen tag domain from, among others 
hemagglutinin or Myc. These coding sequences, when associated with regulatory 
elements which drive their expression, provide signals detectable by conventional 
means, including enzymatic, radiographic, colorimetric, fluorescence or other 



11 



10 



15 



20 



25 



30 



spectrographic assays, fluorescent activating cell sorting assays and immunological 
assays, including enzyme linked immunosorbent assay (ELISA), radioimmunoassay 
(RIA) and .mmunohistochemistry. For example, where the marker sequence is the 
LacZ gene, the presence of the vector carrying the signal is detected by assays for 
beta-galactosidase activity. Where the transgene is GFP or luciferase, the vector 
carrymg the signal may be measured visually by color or light production in a 
luminometer. 

However, desirably, the transgene is a non-marker sequence 
encod.ng a product which is useful in biology and medicine, such as proteins 
pepudes, RNA, enzymes, or catalytic RNAs. Desirable RNA molecules include 
tRNA, dsRNA, ribosomai RNA, si RNAs, sma.1 hairpin RNAs, trans-splicing RNAs 
catalytic RNAs, and antisense RNAs. One example of a useful RNA sequence is a ' 
sequence which extinguishes expression of a targeted nucleic acid sequence in the 
treated animal. 

™ e trans8ene m 'y be used ** treatment, e.g., of genetic 
decencies, as a cancer therapeutic or vaccina, for induction of an immnne response 
and/or f„, prophyUctie vaccine purpose, As used herein, induction of an immune ' 
response rafars «o the abUi«y of a mote* (a.g.. a gana product, to induce a T call 
and/or a humoral i mmu „e response to the molecule. The invention further inciudes 
usntg multiple nansganes, e.g.. to corrac, or ameliorate a condition caused hy a multi- 
subun,, p roKi „. ,„ cmai „ situations> , differeM amsgmc ^ ^ ^ ^ ^ 

each subuni, of a protein, or to encode different pe ptides or proteins This „ 
when the s,ze of the DNA encoding the protein subunit is huge, e g for an 
■mmtmoglobulin, the platele-derived grow* factor, ora dystrophin protei, ,„ order 
for the cell to prcduce the ra „ W . su bu„i, protein, a cell is infected with the 
recontbnan, virus containing each of the different subunits. Altenuttively, different 
subuntts of a protein may be encoded by the same transgene. In this case, a si„g,e 
transgene includes the DNA encoding each of the subunits, with the DNA for each 
subun,, separated by an internal ribozyme entry site (I RES). This is durable when 
■he s 12 e ofUte DNA encoding each of the subunits is small, e . g „ the ^ siK ^ 
DNA encodmg the s „ buniB ta 1R£S „ ^ ^ flve k 

to an IRES, the DNA may be separate, by stances encoding a 2A peptide, whi* 
se.f-cleaves ,„ a post-translation., even, See, e.g., MX. DonneHy, « „/, J. Oen. 
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Virol., 78(Pt 1):13-21 (Jan 1997); Purler, S., et al, Gene Ther., 8(1 1):864-873 (June 
2001); Klump H., et al., Gene Ther., 8(10):81 1-817 (May 2001). This 2A peptide is 
significantly smaller than an IRES, making it well suited for use when space is a 
limiting factor. However, the selected transgene may encode any biologically active 
product or other product, e.g., a product desirable for study. 

Suitable transgenes may be readily selected by one of skill in 
the art. The selection of the transgene is not considered to be a limitation of this 
invention. 

2. Vector and Transgene Regulatory Elements 

In addition to the major elements identified above for the 
minigene, the adenoviral vector also includes conventional control elements which 
are operably linked to the transgene in a manner that permits its transcription, 
translation and/or expression in a cell transfected with the plasmid vector or infected 
with the virus produced by the invention. As used herein, "operably linked" 
sequences include both expression control sequences that are contiguous with the 
gene of interest and expression control sequences that act in trans or at a distance to 
control the gene of interest. 

Expression control sequences include appropriate transcription 
initiation, termination, promoter and enhancer sequences; efficient RNA processing 
signals such as splicing and polyadenylation (polyA) signals; sequences that stabilize 
cytoplasmic mRNA; sequences that enhance translation efficiency (i.e., Kozak 
consensus sequence); sequences that enhance protein stability; and whin desired, 
sequences that enhance secretion of the encoded product. A great number of 
expression control sequences, including promoters which are native, constitutive, 
inducible and/or tissue-specific, are known in the art and may be utilized. 

Examples of constitutive promoters include, without 
limitation, the retroviral Rous sarcoma virus (RSV) LTR promoter (optionally with 
the RSV enhancer), the cytomegalovirus (CMV) promoter (optionally with the CMV 
enhancer) [see, e.g., Boshart et al, Cell, 41:521-530 (1985)], the SV40 promoter the 
d.hydrofo.ate reductase promoter, the P -actin promoter, the phosphog.ycerol kinase 
(PGK) promoter, and the EFlct promoter [Invitrogen]. 

Inducible promoters allow regulation of gene expression and 
can be regulated by exogenously supplied compounds, environmental factors such as 
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temperature, or the presence of a specific physiological state, e.g., acute phase a 
part,cular differentiation state of the cell, or in replicating cells only. Inducible 
promoters and inducible systems are available from a variety of commercial sources 
including, without limitation, Invitrogen, Clontech and Ariad. Many other systems ' 
have been described and can be readily selected by one of skill in the art For 
example, inducible promoters include the zinc-inducible sheep methionine (MT) 
promoter and the dexamethasone (Dex)-inducible mouse mammary tumor virus 
(MMTV) promoter. Other inducible systems include the T7 polymerase promoter 
system [WO 98/10088]; the ecdysone insect promoter [No et al, Proc. Natl Acad 
Sci. USA, 93:3346-3351 (1996)], the tetracycline-repressible system [Gossen et al 
Proc. Natl. Acad. Sci. USA, 89:5547-5551 (1992)], the tetracycline-inducible system 
[Gossen etal, Science, 268:1766-1769 (1995), see also Harvey etal, Curr. Opin 
Chem. Biol., 2:512-518 (1998)]. Other systems include the FK506 dimer, VP16 or 
P65 using castradiol, diphenol murislerone, the RU486- inducible system [Wang et al 
Nat. Biotech., 15:239-243 (1997) and Wang etal, Gene The,, 4:432-44. (1997)] and 
the rapamycin-inducible system [Magari et al, J. Clin. Invest., 100:2865-2872 
(1997)]. The effectiveness of some inducible promoters increases over time In such 
cases one can enhance the effectiveness of such systems by inserting multiple 
repressors in tandem, e.g., TetR linked to a TetR by an IRES. Alternatively, one can 
wa.t at least 3 days before screening for the desired function. One can enhance 
express,on of desired proteins by known means to enhance the effectiveness of this 
system. For example, using the Woodchuck Hepatitis Virus Posttranscriptional 
Regulatory Element (WPRE). 

In another embodiment, the native promoter for the transgene 
will be used. The native promoter may be preferred when it is desired that expression 
of the transgene should mimic the native expression. The native promoter may be 
used when expression of the transgene must be regulated temporally or 
developmental^ or in a tissue-specific manner, or in response to specific 
transcriptional stimuli. In a further embodiment, other native expression control 
elements, such as enhancer elements, polyadenylation sites or Kozak consensus 
sequences may also be used to mimic the native expression. 

Another embodiment of the transgene includes a transgene 
operably linked to a tissue-specific promoter. For instance, if expression in skeletal 
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muscle is desired, a promoter active in muscle should be used. These include the 
promoters from genes encoding skeletal p-actin, myosin light chain 2A, dystrophin, 
muscle creatine kinase, as well as synthetic muscle promoters with activities higher 
than naturally occurring promoters (see Li et al, Nat. Biotech., 17:241-245 (1999)). 
Examples of promoters that are tissue-specific are known for liver (albumin, 
Miyatake et al, J. Virol., 71:5124-32 (1997); hepatitis B virus core promoter, Sandig 
et al, Gene Ther., 3:1002-9 (1996); alpha-fetoprotein (AFP), Arbuthnot etai, Hum. 
Gene Ther., 7:1503-14 (1996)), bone osteocalcin (Stein etai, Mol. Biol. Rep., 
24:185-96 (1997)); bone sialoprotein (Chen etai, J. Bone Miner. Res., 11:654-64 
(1996)), lymphocytes (CD2, Hansal etai, J. Immunol., 161:1063-8 (1998); 
immunoglobulin heavy chain; T cell receptor chain), neuronal such as neuron-specific 
enolase (NSE) promoter (Andersen et al, Cell. Mol. Neurobiol, 13:503-1 5 (1993)), 
neurofilament light-chain gene (Piccioli etai, Proc. Natl. Acad. Sci. USA, 88:561 1-5 
(1991)), and the neuron-specific vgf gene (Piccioli etai, Neuron, 15:373-84 (1995)), 
among others. 

Optionally, vectors carrying transgenes encoding 
therapeutically useful or immunogenic products may also include selectable markers 
or reporter genes may include sequences encoding geneticin, hygromicin or 
purimycin resistance, among others. Such selectable reporters or marker genes 
(preferably located outside the viral genome to be packaged into a viral particle) can 
be used to signal the presence of the plasmids in bacterial cells, such as ampicillin 
resistance. Other components of the vector may include an origin of replication. 
Selection of these and other promoters and vector elements are conventional and 
many such sequences are available [see, e.g., Sambrook et al, and references cited 
therein]. 

These vectors are generated using the techniques and 
sequences provided herein, in conjunction with techniques known to those of skill in 
the art. Such techniques include conventional cloning techniques of cDNA such as 
those described in texts [Sambrook et al, Molecular Cloning: A Laboratory Manual, 
Cold Spring Harbor Press, Cold Spring Harbor, NY], use of overlapping 
oligonucleotide sequences of the adenovirus genomes, polymerase chain reaction, and 
any suitable method which provides the desired nucleotide sequence. 
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». Production of the Recombinant Viral Particle 

In one embodiment, the invention provides a method of generating 
recombinant chimeric adenoviral particles in which the capsid of the chimeric 
adenovirus is of a serotype incapable of efficient growth in the selected host cell A 
vector suitable for production of recombinant chimeric adenoviral particles can be 
generated by direct cloning. Alternatively, such particles can be generated by 
homologous recombination between a first vector containing the left end of the 
ch.meric adenoviral genome and a second vector containing the right end of the 
ch.meric adenoviral genome. However, any suitable methodology known to those of 
sk.ll m the art can be readily utilized to generate a vector suitable to generate a 
production vector, preferably which contains the entire chimeric adenoviral genome 
mcludmg a minigene. This production vector is then introduced into a host cell in ' 
wh,ch the adenoviral capsid protein is assembled and the chimeric adenoviral particle 
assembled as described. 

The chimeric adenoviruses of the invention include those in which one or 
more adenoviral genes are absent, or otherwise rendered non-functional. If any of the 
rn.ss.ng gene functions are essentia, to the replication and infectivity of the adenoviral 
part.de, these functions are supplied by a complementation (or transcomplementing) 
cell hne or a helper vector expressing these functions during production of the 
chimeric adenoviral particle. 

Examples of chimeric adenoviruses containing such missing adenoviral gene 
functions include those which are partially or completely deleted in the Ela and/or 
Elb gene. In such a case, the EI gene functions can be supplied by the packaging 
host cel., permitting the chimeric construct to be deleted of El gene functions and if 
des.red, for a transgene to be inserted in this region. Optionally, the El gene can be 
of a serotype which transcomplements the serotype providing the other adenovirus 
sequences in order to further reduce the possibility of recombination and improve 
safety. In other embodiments, it is desirable to retain an intact Ela and/or Elb region 
m the recombinant adenoviruses. Such an intact El region may be located in its 
nat.ve location in the adenoviral genome or placed in the site of a deletion in the 
native adenoviral genome (e.g., in the E3 region). 

In another example, all or a portion of the adenovirus delayed early gene E3 
may be eliminated from the chimeric adenovirus. The function of adenovirus E3 is 
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believed to be irrelevant to the function and production of the recombinant virus 
part.de. Chimeric adenovirus vectors may also be constructed having a deletion of at 
least the ORF6 region of the E4 gene, and more desirably because of the redundancy 
m the function of this region, the entire E4 region. Still another vector of this 
invention contains a deletion in the delayed early gene E2a. Similarly, deletions in 
the mtermediate genes IX and IVa 2 may be useful for some purposes. Optionally 
deletions may also be made in selected portions of the late genes LI through L5, as 
described above. 

Other deletions may be made in the other structural or non-structural 
adenovirus genes. The above-discussed deletions may be used individually ie an 
adenovirus sequence for use in the present invention may contain deletions in only a 
smgle region. Alternatively, deletions of entire genes or portions thereof effective to 
destroy their biological activity may be used in any combination. For example in one 
exemplary vector, the adenovirus sequence may have deletions of the El genes and 
the E4 gene, or of the El, E2a and E3 genes, oroftheEl and E3 genes, or of El E2a 
and E4 genes, with or without deletion of E3, and so on. As discussed above, such 
deletes may be used in combination with other mutations, such as temperature- 
sensitive mutations, to achieve a desired result. 

Examples of suitable transcomplementing serotypes are provided above The 
use of transcomplementing serotypes can be particularly advantageous where there is 
d.versity between the Ad sequences in the vector of the invention and the human 
AdEl sequences found in currently available packaging cells. In such cases, the use 
of the current human El-containing cells prevents the generation of replication- 
competent adenoviruses during the replication and production process. However in 
certam circumstances, it will be desirable to utilize a cell line which expresses the El 
gene products can be utilized for production of an El -deleted simian adenovirus 
Such cell lines have been described. See, e.g., US Patent 6,083,716. 
A. Packaging Host Cells 

Suitably, the packaging host cell is selected from among cells in 
wh,ch the adenovirus serotype donating the left and right terminal ends of the 
chimeric genome are capable of efficient growth. The host cells are preferably of 
mammalian origin, and most preferably are of non-human primate or human origin 
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Particularly desirable host cells are selected from among any 
mammalian species, including, without limitation, cells such as A549 [ATCC 
Accession No. CCL 185], 91 1 cells, WEHI, 3T3, 10T1/2, HEK 293 cells or PERC6 
(both of which express functional adenoviral El) [Fallaux, FJ etal, (1998) Hum 
Gene Ther, 9:1909-1917], Saos, C2C12, L cells, HT1080, HepG2, HeLa [ATCC 
Accession No. CCL 2], KB [CCL 17], Detroit [e.g., Detroit 5.0, CCL 72] and Wl-38 
[CCL 75] cells, and primary fibroblast, hepatocyte and myoblast cells derived from 
mammals including human, monkey, mouse, rat, rabbit, and hamster. These cell lines 
are all available from the American Type Culture Collection, 10801 University 
Boulevard, Manassas, Virginia 20110-2209. Other suitable cell lines may be 
obtained from other sources. The selection of the mammalian species providing the 
cells ,s not a limitation of this invention; nor is the type of mammalian cell, i.e., 
fibroblast, hepatocyte, tumor cell, etc. 

As described above, a chimeric adenovirus of the invention can lack 
one or more functional adenoviral regulatory and/or structural genes which are 
supplied either by the host cell or in trans to effect packaging of the chimeric 
adenovn-us into the viral capsid to generate the viral particle. Thus, the ability of a 
selected host cell to supply transcomplementing adenoviral sequences may be taken 
into consideration in selecting a desired host cell. 

In one example, the cells are from a stable cell line which expresses 
adenovirus Ela and El b functions from a cell line which transcomplements the 
adenovirus serotype which donates the left and right termini to the chimera of the 
invent™, permitting the chimera to be El-deleted. Alternatively, where the cell line 
does not transcomplement the adenovirus donating the termini, El functions may be 
provided by the chimera, or in trans. 

If desired, one may utilize the sequences provided herein to generate a 
packaging cel. or cell line that expresses, at a minimum, the adenovirus El gene from 
the adenovirus serotype donating the 5' ITR under the transcriptional control of a 
promoter for expression, or a transcomplementing serotype, in a selected parent cell 
line. Inducble or constitutive promoters may be employed for this purpose 
Examples of such promoters are described in detail elsewhere in this specification A 
parent cell is selected for the generation of a novel cell line expressing any desired 
adenovirus or adenovirus gene, including, e.g., a human Ad5, AdPanS, Pan6, Pan7 
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SV1 . SV25 or SV39 gene. Without limitation, such a parent cell line may be HeLa 
[ATCC Accession No. CCL 2], A549 [ATCC Accession No. CCL 185], HEK 293 
KB [CCL 17], Detroit [e.g., Detroit 510, CCL 72] and WI-38 [CCL 75] cells, among 
others. Many of these cel. lines are all avai.able from the ATCC. Other suitable 
parent cell lines may be obtained from other sources. 

Such El -expressing cell lines are useful in the generation of chimeric 
adenovirus El deleted vectors. Additionally, or alternatively, the invention provides 
cell hues that express one or more simian adenoviral gene products, eg Ela Elb 
E2a, and/or E4 ORF6, can be constructed using essential.y the same procedures for 
use ,n the generation of chimeric viral vectors. Such cell lines can be utilized to 
transcomplement adenovirus vectors deleted in the essential genes that encode those 
products, or to provide helper functions necessary for packaging of a helper- 
dependent virus (e.g, adeno-associated virus). The preparation of a host cell 
according to this invention involves techniques such as assembly of selected DNA 
sequences. This assembly may be accomplished utilizing conventional techniques 
Such techniques include cDNA and genomic Coning, which are well known and are 
described in Sambrook et a.., cited above, use of overlapping oligonucleotide 
sequences of the adenovirus genomes, combined with polymerase chain reaction 
synthetic methods, and any other suitable methods which provide the desired 
nucleotide sequence. 

In still another alternative, the essential adenoviral gene products are 
pmvKfcd in trans by the adenoviral vector and/or helper virus. In such an instance a 
suable host. cel. can be selected from any biological organism, including prokaryotic 
(e.g., bacterial) cells, and eukaryotic cells, including, insect cells, yeast cells and 
mammalian cells. Particularly desirable host cells are selected from among any 
mammalian species, including, without limitation, cells such as A549 WEHI 3T3 
1 0T1/2, HEK 293 cells or PERC6 (both of which express functional adenoviral El) 
[Fallaux, FJ et al, (1998), Hum Gene Ther, 9:1909-1917], Saos, C2C12 L cells 
HT1080, HepG2 and primary fibroblast, hepatocyte and myoblast cells derived from 
mammals including human, monkey, mouse, rat, rabbit, and hamster. The selection 
of the mammalian species providing the cells is not a limitation of this invention- nor 
•s the type of mammalian cell, i.e., fibroblast, hepatocyte, tumor cell, etc. 
B. Helper Vectors 
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Thus, depending upon the adenovirus gene content of the adenoviral 
vectors and any adenoviral gene functions expressed from the host cell, a helper 
vector may be necessary to provide sufficient adenovirus gene sequences necessary to 
produce an infective recombinant viral particle containing the minigene. See, for 
example, the techniques described for preparation of a "minimal" human Ad vector in 
International Patent Application W096/13597, published May 9, 1996, and 
incorporated herein by reference. Suitably, these helper vectors may be non- 
replicating genetic elements, a plasmid, or a virus. 

Useful helper vectors contain selected adenovirus gene sequences not 
present in the adenovirus vector construct and/or not expressed by the packaging cell 
line in which the vector is transfected. In one embodiment, the helper virus is 
replication-defective and contains a variety of adenovirus genes in addition to the 
sequences described above. Such a helper vector is desirably used in combination 
with an El -expressing cell line. 

Helper vectors may be formed into poly-cation conjugates as 
described in Wu etal,J. Biol. Chen,., 264:16985-16987 (1989); K. J. Fisher and J M 
Wilson, Biochem. J., 299:49 (April 1, 1994). A helper vector may optionally contain 
a second reporter minigene. A number of such reporter genes are known to the art. 
The presence of a reporter gene on the helper virus which is different from the 
transgene on the adenovirus vector allows both the Ad vector and the helper vector to 
be mdependently monitored. This second reporter is used to enable separation 
between the resulting recombinant virus and the helper virus upon purification. 
C. Assembly of Viral Particle and Transfection of a Cell Line 
Generally, when delivering the vector comprising the minigene by 
transfection, the vector is delivered in an amount from about 5 ug to about 100 ug 
DNA, and preferably about 10 to about 50 ug DNA to about 1 x 10< cells to about 1 x 
10" cells, and preferably about 10 5 cells. However, the relative amounts of vector 
DNA to host cells may be adjusted, taking into consideration such factors as the 
selected vector, the delivery method and the host cells selected. 

Introduction into the host cell of the vector may be achieved by any 
means known in the art or as disclosed above, including transfection, and infection 
One or more of the adenoviral genes may be stably integrated into the genome of the 
host cel., stably expressed as episomes, or expressed transiently. The gene products 
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may all be expressed .ransiendy, on „ episorac „ ^ ^ ^ 

gene produce may be expressed stably while others are expressed transiendy. 

Furthermore, the promoters f or each of the adenoviral genes may be 
selected independently from a eonstitotive p rom „,er, an inducible promoter or , 
nattve adenoviral prom oter. The promoters may be regulated by a specific 
Physiological state of the organism or eel, (i.e., by the differentiation . or in 
replying or quiescent cells) o, by exogenous* added factors, f or example 

Introduction of the molecules (as p lasmids or vinBes) Mo ^ hoM 
cel. may also be accomplish* „s tog , echmqoes ^ ,„ fc ^ ^ ^ 
dtscussed throughout me specification. In preferred embodiment, standard 
•ransfection techniques are used, e.g., CaPO. .ransfection or elecux>po Mt i„„ 

Assembly of the selected DNA sequences of the adenovirus (as well 
as the transgene and other vector elements) inR> various iMemKdiaK p 
*= use plasmids and vectors to produce a recombinant viral particle are a.l 
achteved using conventional techniques. Such techniques include direc, cloning as 
described [G. Gao m a,, Gene Ther. 2003 Octi 1<K.22): 19 2„.,930; US Paten, 
fttblicationNo. 2003-0092161 -A, published May .5, 2003; International Paten. 
ApphcationNo. PCMJS03/12405). Cher cloning techniques of cDNA such as 
those described in texts [Sambrook e, al, cited above], use of overlapping 
oligonucleotide sequences of ft. adenovirus genomes, polymerase chain reaction, and 
any suitable method which provides .he desired nucleotide sequence on be utilized 
Standard .ransfection and co-transfection techniques ara employed, e. s CaP0 4 
preciphation techniques. Other conventiona, medtods employed include hotnologous 
tecombmation of Ute vira, genomes, ptoquing of viruses in agar over|ay 
measuring signal generation, and the like. 

For example, following .he construction and assembly of the desired 
mmigene-containing viral vector, the vector is transfected ,„ vitn, in lhe presence of 
an opnonal h=,per vector into the packaging cel, line. The fitnetiona exposed Snm 
the plasmid, packaging cell line and helper vinut, if any, pe™* me a<Wirus . 
transgene sequences in the vector to be replicated a* packaged into virion capsids 
resulting in the chimeric viral particles. The current method for producing such vir'us 
particles is .ransfec.ion-bt.ed. However, me invention is no, .imited .„ such 
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method, The resulting chimeric adenoviruses are use*, in transferring a selected 
transgene to a selected cell. 

Ill- Use of the Chimeric Adenovirus Vectors 

The chimeric adenovirus vectors of the invention are useful for gene transfer 
to a human or veterinary subject (including, non-human primates, non-simian 
primates, and other mammals) in vitro, ex vivo, and in vivo. 

The recombinant adenovirus vectors described herein can be used as 
expression vectors for the production of the products encoded by the heterologous 
genes Unvitro. For example, the recombinant adenoviruses containing a gene inserted 
mto the location of an El deletion may be transfected into an El-expressing cell line 
as described above. Alternatively, rep.ication-competent adenoviruses may be used 
m another selected cell line. The transfected cells are then cultured in the 
conventional manner, allowing the recombinant adenovirus to express the gene 
produ Ct from the promoter. The gene product may then be recovered from the culture 
med.um by known conventional methods of protein isolation and reC overy from 
culture. 

A ehimerie adenoviral vector of the invention provides an efficient gen. 
transfer vehicle that can de.iver a selected transgene to a selected hos, cell ,n v™ or 
«t v/vo even where the organism has neutralizing antibodies to one or more AAV 
serotype, ,„ one embodiment, the rAd and the cells are mixed e, Wvo; me infected 
cells are cultured using convention,! methodologies; and the tntnsduced cells are re- 
utfttsed into the patient These impositions are particularly „ el | suited to gene 
deltvery for therapeutic purposes and for immunization, including inducing protective 
immunity. 

More commonly, the chimeric adenoviral vectors of the invention will be 
ut-hzed for delivery of therapeutic or immunogenic molecules, as described below It 
w,.l be readily understood for both applications that the recombinant adenoviral 
vectors of the invention are particularly well suited for use in regimens involving 
repeat delivery of recombinant adenoviral vector, Such regimens typically involve 
dehvery of a series of viral vectors in which the viral capsids are alternated. The viral 
capsids may be changed for each subsequent administration, or after a pre-selected 
number of administrations of a particular serotype capsid (e.g., one, two, three, four 
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or more). Thus, a regimen may involve delivery of a rAd with a first capsid, delivery 
with a rAd with a second capsid, and delivery with a third capsid. A variety of other 
regimens which use the Ad capsids of the invention alone, in combination with one 
another, or in combination with other Ad serotypes will be apparent to those of skill 
m the art. Optionally, such a regimen may involve administration of rAd with capsids 
of non-human primate adenoviruses, human adenoviruses, or artificial (e.g., chimeric) 
serotypes such as are described herein. Each phase of the regimen may involve 
administration of a series of injections (or other delivery routes) with a single Ad 
serotype capsid followed by a series with another Ad serotype capsid. Alternatively 
the recombinant Ad vectors of the invention may be utilized in regimens involving 
other non-adenoviral-mediated delivery systems, including other viral systems non- 
v,ral delivery systems, protein, peptides, and other biologically active molecules. 

The following sections will focus on exemplary molecules which may be 
delivered via the adenoviral vectors of the invention. 

A. Ad-Mediated Delivery of Therapeutic Molecules 

In one embodiment, the Ad vectors described herein are administered 
to humans according to published methods for gene therapy. A viral vector of the 
invention bearing the selected transgene may be administered to a patient, preferably 
suspended in a biologically compatible solution or pharmaceutical ly acceptable 
delivery vehicle. A suitable vehicle includes sterile saline. Other aqueous and non- 
aqueous isotonic sterile injection solutions and aqueous and non-aqueous sterile 
suspensions known to be pharmaceutical ly acceptable carriers and well known to 
those of skill in the art may be employed for this purpose. 

The adenoviral vectors are administered in sufficient amounts to 
transduce the target cells and to provide sufficient levels of gene transfer and 
expression to provide a therapeutic benefit without undue adverse or with medically 
acceptable physiological effects, which can be determined by those skilled in the 
medical arts. Conventional and pharmaceutical* acceptable routes of administration 
•nclude, but are not limited to, direct delivery to the retina and other intraocular 
delivery methods, direct delivery to the liver, inhalation, intranasal, intravenous, 
intramuscular, intratracheal, subcutaneous, intradermal, rectal, oral and other 
parenteral routes of administration. Routes of administration may be combined, if 
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desired, or adjusted depending upon the transgene or the condition. The route of 
ad m ,n,str a tion primarily will depend on the nature of the condition being treated. 

Dosages of the viral vector will depend primarily on factors such as 
the condition being treated, the age, weight and health of the patient, and may thus 
vary among patients. For example, a therapeutically effective adult human or 
veterinary dosage of the viral vector is genera.ly in the range of from about 100 uL to 
about 100 mL of a carrier containing concentrations of from about 1 x 10 6 to about 1 
x 10 particles, about 1 x 10" to 1 x 10» particles, or about 1 x 10* to Ix 10* 
part.cles. Dosages will range depending upon the size of the animal and the route of 
administration. For example, a suitable human or veterinary dosage (for about an 80 
kg ammal) for intramuscular injection is in the range of about 1 x 10* to about 5 x 
10 particles per mL, for a single site. Optionally, multiple sites of administration 
may be delivered. In another example, a suitable human or veterinary dosage may be 
■n the range of about 1 x 10" to about 1 x 10" particles for an oral formulation. One 
of skdl ,n the art may adjust these doses, depending the route of administration, and 
the therapeutic or vaccinal application for which the recombinant vector is employed 
The levels of expression of the transgene, or for an immunogen, the level of 
crculating antibody, can be monitored to determine the frequency of dosage 
administration. Yet other methods for determining the timing of frequency of 
adm.nistration will be readily apparent to one of skill in the art. 

An optional method step involves the co-administration to the 
panent, either concurrently with, or before or after administration of the viral vector 
of a suitable amount of a short acting immune modulator. The selected immune 
modulator is defined herein as an agent capable of inhibiting the formation of 
neutralizing antibodies directed against the recombinant vector of this invention or 
capable of inhibiting cytolytic T lymphocyte (CTL) elimination of the vector The 
■mmune modulator may interfere with the interactions between the T helper subsets 
(T H , or Th2 ) and B cells to inhibit neutralizing antibody formation. Alternatively the 
.mmune modulator may inhibit the interaction between T H , cells and CTLs to reduce 
the occurrence of CTL elimination of the vector. A variety of useful immune 
modulators and dosages for use of same are disclosed, for example, in Yang et al J 
V.rol, 70(9) (Sept 1996); International Patent Application No. WO96/12406 
published May 2, 1996; and International Patent Application No.PCT/US96/03035 
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all incorporated herein by reference. Typically, such immune modulators would be 
selected when the transgene is a therapeutic which requires repeat delivery. 
1. Therapeutic Transgenes 

Useful therapeutic products encoded by the transgene include 
hormones and growth and differentiation factors including, without limitation, insulin 
glucagon, growth hormone (GH), parathyroid hormone (PTH), growth hormone ' 
releasing factor (GRF), follicle stimulating hormone (FSH), luteinizing hormone 
human chorionic gonadotropin (hCG), vascular endothelial growth factor 
(VEGF), angiopoietins, angiostatin, granulocyte colony stimulating factor (GCSF) 
erythropoietin (EPO), connective tissue growth factor (CTGF), basic fibroblast ' 
growth factor (bFGF), acidic fibroblast growth factor (aFGF), epidermal growth 
factor (EGF), transforming growth factor a (TGF a), platelet-derived growth factor 
(PDGF), msulin growth factors I and II (IGF-I and IGF-II), any one of the 
transforming growth factor superfamily, including TGF, activins, inhibins, or any of 
the bone morphogenic proteins (BMP) BMPs 1-15, any one of the 
heregluin/neuregulin/ARIA/neu differentiation factor (NDF) family of growth factors 
nerve ^ factor (NGF), brain-derived neurotrophic factor (BDNF), neurotrophic' 
NT-3 and NT-4/5, diary neurotrophic factor (CNTF), glial cell line derived 
neurotrophic factor (GDNF), neurturin, agrin, any one of the family of 
semaphorins/collapsins, netrin-1 and netrin-2, hepatocyte growth factor (HGF) 
ephnns, noggin, sonic hedgehog and tyrosine hydroxylase. 

Other useful transgene products include proteins that regulate the 
immune system including, without limitation, cytokines and lymphokines such as 
thrombopoietin (TPO), inter.eukins (IL) IL-1 through IL-25 (including, e g IL-2 IL 
4, IL-.2 and IL-18), monocyte chemoattractant protein, leukemia inhibitory factor 
granulocyte-macrophage colony stimu.ating factor, Fas ligand, tumor necrosis factors 
and, mterferons, and, stem cell factor, flk-2/fh3 ligand. Gene products produced by 
the unmune system are also useful in the invention. These include, without 
l.m.tation, immunoglobulins IgG, IgM, IgA, IgD and IgE, chimeric immunog.obulins 
human,zed antibodies, single chain antibodies, T cel. receptors, chimeric T cel. ' 
receptors, sing.e chain T cel. receptors, class I and class II MHC molecules, as we., as 
engmeered immunog.obu.ins and MHC mo.ecu.es. Useful gene products also include 
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consent regulatory proteins such as complement regulatory proteins, membrane 
cofactor prote.n (MCP), decay accelerating factor (DAF), CR1, CF2 and CD59 

Still other useful gene products include any one of the receptors for 
the hormones, growth factors, cytokines, lymphokines, regulatory proteins and 
■mmune system proteins. The invention encompasses receptors for cholesterol 
regulation, including the low density lipoprotein (LDL) receptor, high density 
lipoprotein (HDL) receptor, the very low density lipoprotein (VLDL) receptor 
proteins useful in the regulation of lipids, including, e.g., apoprotein (apo)A and 
•ts .soforms (e.g., ApoAI), apoE and its isoforms including E2, E3 and E4), SRB1 
ABCl, and the scavenger receptor. The invention also encompasses gene products 
such as members of the steroid hormone receptor superfamily including 
glucocorticoid receptors and estrogen receptors, Vitamin D receptors and other 
nuclear receptors. In addition, useful gene products include transcription factor, such 
as jun, fos, max, mad, serum response factor (SRF), AP-1, AP2, myb, MyoD and 
myogenin, ETS-box containing proteins, TFE3, E2F, ATFI, ATF2 ATF3 ATF4 
ZF5 NFAT, CREB, HNF-4, C/EBP, SP1, CCAAT-box binding proteins, interferon 
regu afon factor (IRF-1), Wilms tumor protein, ETS-binding protein, STAT, GATA- 
box bmdmg proteins, e.g., GATA-3, and the forkhead family of winged helix 
proteins. 

Other useful gene products include, carbamoyl synthetase I, ornithine 
transcarbamylase, arginosuccinate synthetase, arginosuccinate lyase, arginase 
fumarylacetacetate hydrolase, phenylalanine hydroxylase, alpha-1 antitrypsin' 
glucose-6-phosphatase, porphobilinogen deaminase, cystathione beta-synthase 
branched chain ketoacid decarboxylase, albumin, isovaleryl-coA dehydrogenase 
prop Ion yl CoA carboxylase, methyl malonyl CoA mutase, glutaryl CoA 
dehydrogenase, insulin, beta-glucosidase, pyruvate carboxylate, hepatic 
Phosphorylase, phosphory.ase kinase, glycine decarboxylase, H-protein, T- P rotein a 
cyst.c fibrosis transmembrane regulator (CFTR) sequence, and a dystrophin cDNA 
sequence. Other useful gene products include those useful for treatment of 
hemophilia A (e.g., Factor VIII and its variants, including the light chain and heavy 
cham of the heterodimer, optionally operab.y linked by a junction), and the B-domain 
deleted Factor VIII, see US 6,200,560 and 6,221,349], and useful for treatment of 
hemophilia B (e.g. Factor IX). 
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Still other useful gene prafucls mc , u(te n0 „. nalura| , y oc 
polypeptides, such as chimeric or hybrid polypeptides having a no„.„a,ura lly 
occurring amino acid sequence containing insertions, deletions or amino acid 
substitutions. For example, si„gl«hain engineered immunoglobulins could be useful 
m certain ,mmun„com P romised patient. Ofher types of non-naturally occurring gene 
sequences include an.isense molecu.es and cafalvtic nucleic acids, such as ribozymes 
which could be used to reduce overexpression of, uuget. 

Reducdon and/or modulation of expression of a gene an, particularly 
destrable for treatment of hyperproliferanve conditions chamcterized by 
hyperprolifemting cells, as arc cancers and psoriasis. Targe, polypeptides include 
those polypeptides which are produced exclusively or at higher levels in 
hypen.rolifera.ive cells as compared ,o normal colls. Targe, antigens include 
polypeptide, encoded by oncogenes such as myb, myc, fyn, and the t^cc^on 
gene bcr/abl, ras, src, P53, „eu, „k and EORF. In addition to oncogene products as 
targe, antigens, targe, polypeptides for anti-cancor treatments and protective mgimens 
mclude variable regions of antibodies made by B cell lymphomas and variable 
reg.ons of T cel, receptors of T eel, lymphomas which, in some embodiments, are 
also used as „rge. antigens f„, autoimmune disease. Other ft,mor-associ.,ed 
polypeptides can be used as Urge, polypeptides such as po.ypeptides which am f ou „ d 
a, higher levels in mmor cells including me polypeptide mcognized by monoclonal 
antibody 17-1A and folate binding polypeptides. 

Other suitable merapeutic polypeptides and proteins include those 
which may be useful recreating individuals suffering fiom au.oimmu„e diseases and 
didders by conferring a broad based protective immune response against targets tha, 
are assorted wi,h autoimmune including cell recepmrs and cells which produce 
self-directed antibodies. T-cell mediate* autoimmune disease, include rheumatoid 
arthritis <RA), multiple sclerosis (MS), Sjdgmn's syndmme, sarcoidosis, insulin 
dependent diabetes mel.ites (1DDM), autoimmune thyroiditis, reactive arthritis 
ankylosing spondylitis, scleroderma, polymyositis, dermatomyositis, psoriasis ' 
vasculitis, Wegener's granulomatosis, Crohn's disease and ulcerative colitis. Each of 
these diseases is characterized by T cell receptors (TCRs) ma. bind ,o endogenous 
antigens and initiate me inflammatory cascade associated with autoimmune diseases 
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. , , ™ C Ch "" eriC adenoviral «»» »f 1" invention are particularly wall 

suued for ,herapeu,ic regimens in which mu|tip|e ^ J" 1 

.ransgenea is desired , , g „ ,„ regimMs ^ ^ 

■n combmatton regime™ involving delivery of other transgene, Such mgimens raay 

■nvolve adrntmarauon of a chimeric adenoviral vector, followed by re-ad m i„ ia ,a t io„ 

3 *" — «W — *- P-ico-arly desirable mgimens 
■nvolve adn, nisnation of a chimeric adenovita, vector of*, invention, i„ which fte 
serotype of the viml vector delivered in the flrs , administration differs ftom the 
serotype of the vira, vector u,i,i 2 ed in one or more of the subsequent adminis^ions 
For example, a therapeutic regimen invoivesadministmtion of a chimeric vector and 
repeat aum.mstration with one or more adenoviral vectors of the same or different 
serotype, ,„ another example, a therapeudc regimen involves admini^tion of an 
adenovtral vector followed by repeat administration with a chimeric vectce of the 
mvenhon which differs from *. serotype of the firs, deliver adenoviral veetor, and 
phona ly fcrmer administration with another vector which is the same or, pmferably 
dtffers from the serotype of the vector in the prior adminislmion aeps ^ 
reg.mens are no, limted to delivery of adenoviral vectors c nsm , cttd using ^ 
chimeric sertaypes of the invention. Rather, these regimens can readily utilize 
chuneric or non-chimeric vector of other aoenovira, serotypes, which may be of 
arhfical, human or non-human primate, or other mammalian sources, in combination 
w„h one or mom of the chimeric vectors of the invention. Examples of such 
serotypes are discussed elsewhere in this d „c„ment. Further, dtese therapeutic 
regtmens may involve either simultaneous or sequential delivery of chimeric 
adenoviral vectors of the invention in combination with non-adenovira, vectors, non- 
vua, vectors, and/or a variety of Cher therapeuucaUy useful compounds or mo.ecule, 
Th presen, invention is no, limited ,„ these ,herapeu,ic regimen, a variety of which 
will be readily apparent to one of skill in the art. 

B. Ad-Mediated Delivery of Immunogenic Transgenes 
The adenoviruses of the invention may also be employed as 
■mmunogenic composition, As used herein, an immunogenic composition is a 
composition to which a humora. (e.g., antibody) or cellular (e.g., . cytotoxic T cel.) 
response ,s mounted to a transgene product delivered by the immunogenic 
composition following delivery to a mammal, and preferably a primate. The present 
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.nv.„«,o„ provides an Ad that can contain in any of its adenovirus sequence deletions 
a gene encoding a desired immunogen. Chimeric adenoviruses based on simian or 
other non-human mammalian primate serotypes are Italy „ be better suited for use 
as a hve recombinant virus vaccine in different animal species Compaq ,o an 
adenovirus of human origin, bu, ia no, hW ,o such a use. The recombinant 
adenoviruses can be used aa prophylactic or therapeutic vaccines again* any 
P«hogen for which fc antige „ (s) ^ fc ^ 

able tc .limit the apread of the pathogen has been identified and for which the cDNA is 
available. 

Such vaccinal (or other immunogenic) compositions are formulated in 
a su,table delivery vehicle, as described above. Generally, doses for the 
immunogenic compositions are in the range defined above for therapeutic 
compositions. The levels of immunity of the selected gene can be monitored to 
determine the need, if any, for booster, Following an assessment of antibody titers in 
the serum, optional booster immunizations may be desired. 

Optionally, a vaccinal composition of the invention may be formulated 
to certain other components, including, e.g. adjuvants, stabilizers, pH adjusters 
preservatives and the like. Such components are well known to those of skil. in the 
vaccme art. Examples of suitable adjuvants include, without limitation, liposomes 
alum, monophosphory. lipid A, and any biologically active factor, such as cytokine 
an mterleukin, a chemokine, a ligands, and optimally combinations thereof. Certain'of 
these biologically active facto, can be expressed in vivo, e.g., via a p.asmid or viral 
vector. For example, such an adjuvant can be administered with a priming DNA 
vaccne encoding an antigen to enhance the antigen-specific immune response 
compared with the immune response generated upon priming with a DNA vaccine 
encoding the antigen only. 

The adenoviruses are administered in "an immunogenic amount", that 
-s, an amount of adenovirus that is effective in a route of administration to transfect 
the desired cells and provide sufficient leve.s of expression of the selected gene to 
mduce an immune response. Where protective immunity is provided the 
recombinant adenoviruses are considered to be vaccine compositions useful in 
preventing infection and/or recurrent disease. 



29 



Alternatively, or in addition, the vectors of the invention may contain 
a transgene encoding a peptide, polypeptide or protein which induces an immune 
response to a se.ected immunogen. The recombinant adenoviruses of this invention 
are expected to be highly efficacious at inducing cytolytic T cells and antibodies to 
the insetted heterologous antigenic protein expressed by the vector. 

For example, immunogens may be selected from a variety of viral 
families. Example of desirable viral families against which an immune response 
would be desirable include, the picornavirus family, which includes the genera 
rhmov.ruses, which are responsible for about 50% of cases of the common cold- the 
genera enteroviruses, which include poxviruses, coxsackieviruses, echoviruses and 
human enteroviruses such as hepatitis A virus; and the genera apthoviruses, which are 
respons.b.e for foot and mouth diseases, primarily in non-human animals. Within the 
p.cornavirus family of viruses, target antigens include the VP1, VP2 VP3 VP4 and 
VPG. Another viral family includes the calcivirus family, which encompasses me 
Norwalk group of viruses, which are an important causative agent of epidemic 
gastroenteritis. Still another viral family desirable for use in targeting antigens for 
mducmg immune responses in humans and non-human animals is the togavirus 
fam.ly, which includes the genera alphavirus, which include Sindbis viruses 
RossRiver virus, and Venezuelan, Eastern & Western Equine encephalitis, and 
rubivirus, including Rubella virus. The flaviviridae family includes dengue, yellow 
fever, Japanese encephalitis, St. Ix>uis encephalitis and tick borne encephalitis 
v-ruses. Other target antigens may be generated from the Hepatitis C or the 
coronavirus family, which includes a number of non-human viruses such as infectious 
bronchitis virus (poultry), porcine transmissible gastroenteric virus (pig), porcine 
hemagglutinatin encephalomyelitis virus (pig), f elme infectious peritonitis virus 
(cats), felme enteric coronavirus (cat), canine coronavirus (dog), and human 
resp,ratory coronaviruses, which may cause the common cold and/or non-A B or C 
hepatms. In addition, the human coronaviruses include the putative causative agent 
of sudden acute respiratory syndrome (SARS). Within the coronavirus family target 
antigens include the El (also called M or matrix protein), E2 (also called S or Spike 
protein), E3 (also called HE or hemagglutin-elterose) glycoprotein (not present in all 
coronaviruses), or N (nucleocapsid). Still other antigens may be targeted against the 
rhabdovirus family, which includes the genera vesiculovirus (e.g., Vesicular 
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Stomatitis Virus), and the general lyssavirus (e.g., rabies). Within the rhabdovirus 
family, suitable antigens may be derived from the G protein or the N protein. The 
family filoviridae, which includes hemorrhagic fever viruses such as Marburg and 
Ebola virus, may be a suitable source of antigens. The paramyxovirus family 
includes parainfluenza Virus Type 1, parainfluenza. Virus Type 3, bovine 
parainfluenza Virus Type 3, rubulavirus (mumps virus), parainfluenza Virus Type 2 
parainfluenza virus Type 4, Newcastle disease virus (chickens), rinderpest, 
morbillivirus, which includes measles and canine distemper, and pneumovirus, which 
includes respiratory syncytial virus. The influenza virus is classified within the 
family orthomyxovirus and is a suitable source of antigen (e.g., the HA protein, the 
Nl protein). The bunyavirus family includes the genera bunyavirus (California 
encephalitis, La Crosse), phlebovirus (Rift Valley Fever), hantavirus (puremala is a 
hemahagin fever virus), nairovirus (Nairobi sheep disease) and various unassigned 
bungaviruses. The arenavirus family provides a source of antigens against LCM and 
Lassa fever virus. The reovirus family includes the genera reovirus, rotavirus (which 
causes acute gastroenteritis in children), orbiviruses, and cultivirus (Colorado Tick 
fever), Lebombo (humans), equine encephalosis, blue tongue. 

The retrovirus family includes the sub-family oncorivirinal which 
encompasses such human and veterinary diseases as feline leukemia virus, HTLVI 
and HTLVII, lentivirinal (which includes human immunodeficiency virus (HIV), 
simian immunodeficiency virus (SIV), feline immunodeficiency virus (F1V), equine 
infectious anemia virus, and spumavirinal). Among the lentiviruses, many suitable 
antigens have been described and can readily be selected. Examples of suitable HIV 
and SIV antigens include, without limitation the gag, pol, Vif, Vpx, VPR, Env, Tat 
Nef, and Rev proteins, as well as various fragments thereof. For example, suitable' 
fragments of the Env protein may include any of its subunits such as the g P 120, 
gpl60, gp41, or smaller fragments thereof, e.g., of at least about 8 amino acids in 
length. Similarly, fragments of the tat protein may be selected. [See, US Patent 
5,891,994 and US Patent 6,193,981.] See, also, the HIV and SIV proteins described 
in D.H. Barouch et al, J. Virol, 75(5):2462-2467 (March 2001), and R.R. Amara et 
al, Science, 292:69-74 (6 April 2001). In another example, the HIV and/or SIV ' 
immunogenic proteins or peptides may be used to form fusion proteins or other 
immunogenic molecules. See, e.g., the HIV-1 Tat and/or Nef fusion proteins and 
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WO 99/16884, pubhsned April 8, ,, 9 ,. T„ e mvendon „ „ M ^ ^ ^ 
«*r HV immunogenic ^ orpeptldes heK|n 

Pa en, 5,972,596. Further, any desired HIV and/or S,V immune may ^ 
dehveted alone „ r i„ combination . Suc „ 

single vector or from multiple vector! rwi„„.n u as'ontroma 
deliverv „f amorally, another combination may involve 

deltvety of one or moreexpress*. immunogens with delivery of one or more of the 
unmunogens i„ prolei „ fonn . Suc „ ^ ^ ^ ^ ^ 

(BKU and TP, , ^ T"*" ^ i " C "' deS SUb - fami " i«*<— *— 
(BKU and ,CU vtruses, and the sub-femily papinontaviros (associated with capers 

AD7 ARD, O B.) wh.c cause respir^ diseese and/or enteritis. The points 
mcludes femtly feline parvovirus (feline enteritis,, feline penlcucopenJL, cani e 
parvov,^ and porcine parvovirus. The heipesvirus fam „ y ^ fc ^ " 
alphahetpesvtrinae, which encompasses the genera simplexvirus (HSV1 HSVH) 
vancenovirua (pseudotabes, varicella zoster, and the sub-family b^p-JL 
whtch ,„ c ,udes the genera cyfcmegalovirus (HCMV, muromegelovirtZd me 
su - a™, g ammah irime , which includes the ge™ lymphocytic, EBV 
(Burktrts ly mphoma)> mfectious mi „ otrachei , is Marek , s ^ 

rhedinoviru, The poxvirus family includes the sub-family chortopoxvirinae, which 
encompasses the genera orthopoxvirus (Varioia (Smailpox) and Vaccinia (Cowpox)) 
parapoxvt^s, avipoxvta, capripoxviru., leponpoxvirua, suipoxvirus, end the 
sub-famtly entomopoxvirinae. The hepadnavirus family includes the Hepatitis B 
vtrus. One unclassified virus which may be suitable soutce of antigens is the 
Hepatitis delta vinus. Stil, other vim, sources may include avian infectious bursal 
disease v„us and porcine respirato* and reproductive syndrome virus. The 
elphavtrua famiiy includes equine arteritis virus end various Encephaiitia viruses 
The vtruses of the present invention m a y a , S o ceny immunogens 
whtch are usefid to immunize a humen ornon-hum a „ 
mdudtng bacerta, fcngi, ^ raicroorganiOTS „ ^ 
mfec, human and non-humen vertebrates, or fmm a cancer cell or tumor eel, 
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Examples of bacterial pathogens include pathogenic gram-positive cocci include 
pneumococci; staphylococci; and streptococci. Pathogenic gram-negative cocci 
mclude meningococcus; gonococcus. Pathogenic enteric gram-negative bacil.i 
mclude enterobacteriaceae; pseudomonas, acinetobacteria and eikenella; melioidosis- 
salmonella; shige.la; haemophi.us; moraxella; H. ducreyi (which causes chancroid)- ' 
brucella; Franisella tularensis (which causes tularemia); yersinia (pasteure.la)- 
streptobacillus moniliformis and spirillum; Gram-positive bacilli include listeria 
monocytogenes; erysipelothrix rhusiopathiae; Corynebacterium diphtheria 
(d.phtheria); cholera; B. anthracis (anthrax); donovanosis (granuloma inguinale); and 
bartonel.osis. Diseases caused by pathogenic anaerobic bacteria include tetanus- 
botulism; other Clostridia; tuberculosis; leprosy; and other mycobacteria. Pathogenic 
sp.rochetal diseases include syphilis; treponemal: yaws, pint* and endemic 
syph.hs; and leptospirosis. Other infections caused by higher pathogen bacteria and 
pathogenic fungi include actinomycosis; nocardiosis; cryptococcosis, blastomycosis 
h-stoplasmosis and coccidioidomycosis; candidiasis, aspergillosis, and mucormycosis- 
sporotnchosis; paracoccidiodomycosis, petrie.lidiosis, torulosis, mycetoma and 
chromomycosis; and dermatophytosis. Rickettsial infections include Typhus fever 
Rocky Mountain spotted fever, Q fever, and Rickettsialpox. Examples of 
mycoplasma and chlamydia, infections include: mycoplasma pneumoniae- 
lymphogranuloma venereum; psittacosis; and perinatal chlamydial infections 
Pathogenic eukaryotes encompass pathogenic protozoans and helminths and 
mfections produced thereby include: amebiasis; malaria; leishmaniasis- 
trypanosomiasis; toxoplasmosis; Pneumocystis carinii; Trichns; Toxoplasma gondii- 
babesiosis; giardiasis; trichinosis; fi.ariasis; schistosomiasis; nematodes; trematodes ' 
^ or flukes; and cestode (tapeworm) infections. 

Many of these organisms and/or toxins produced thereby have been 
•denffied by the Centers for Disease Control [(CDC), Department of Heath and 
Human Services, USA], as agents which have potential for use in biological attacks 
For example, some of these biological agents, include, Bacillus anthracis (anthrax) 
Clostridium W/num and its toxin (botulism), Yersinia pestis (plague), variola major 
(smallpox), Francisella tularensis (tularemia), and viral hemorrhagic fevers 
[filoviruses (e.g., Ebola, Marburg], and arenaviruses [e.g., Lassa, Machupo]), all of 
wh,ch are currently classified as Category A agents; Coxiella burnetii (Q fever)- 
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Brucella species (brucellosis), «*, (glands), ***** 

t* (n,el0idMiSX — ~ * «* toxin), CW, UM 

M. and i,s toxin (eps „ on toxin) , Staphy|ococciis ^ ^ ^ *" 

c*oW C^w^ W , Typhus fever ( ^„ Ijopo Md 
encep a „„s (alphavinrses, e g „ Vene2 „ elan ^ 

encepnalths; western aouina encephalitis); .„ of which „ c|aKif . ed 

c enB; " < Nipan virus md han ** viruses ' which - "~* 

as Ca^ry C agan,, ,„ addition, other organisms> wnicn are ^ 

C 'dT ed ; ^ ide " ,ifled ^ " <n *. fihure. 

. w,ll be readdy „nders,ood ,na, «he vW vccors and other cornea described 
ere, a re ^ t0 oellver ^ ^ ^ ^ ^ 

ofter by-products, which wil, peeven, and/or hea, infecion or orber adverse reactions 
with these biological agents. «<-iions 

Adrainiara,i< »"°'''l'"^of Ul einve„,ion K >deliveri mmun oge„ s 
=ga,ns, «,e variahle region of dre T cells elicit an immmt response including Z " 
.hose T cell, ,„ RA, severa, specific variable regions of TCR$ _ 
volved.n the dtsease have been cWeriacd. These TCRs include V-3 V-14 

If' I 7 and 7 T S ' of a " ucteic acid ~ ,h * - 

°^r^ ,tei ' aniw ^ fe, ™ i, ^ T « , ^"vo.v« J 

•n RA. In MS, severa! stifle variable region of TC Rs which an, i„v ol ved in the 
disease ave been characterized. These TCRs inctade V-7 and Vu-,0. Thus, delivery 
of a nucietc acid seouence that encodes a, leas, one of these ^peptides wil, eliei, I 
.nrmune response fc, will ^ T ^ ^ ^ Ms ^ ^ 

spec.fic variable regions of TCRs which are involved in the disease have been 
chanrcteri^d. These TCRs include V-o, V.g, V-,4 and Va-16. Va-3C, Va-7, Va-,4 
Va- 15 ,V„. l 6,Vu- 2 ga„dV„- I2 . Thus, deliver of a chinter, adenovirus , hi, 
encodes a, leas, one of Utese p„,ypep ti des wil, elici, an immune response _„ 
target T cells involved in scleroderma. 

C. Ad-Mediated Delivery Methods 

The therapeutic levels, or leve.s of immunity, of the selected gene can 

be mon.tored to determine the need, if any for boo^rc p^.i • 

rnj ,. T „ ' ny ' tor booster s. Following an assessment of 

CD8 + T cell response, or optionally, antibody titers, in the serum, optional booster 
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immunizations may be desired. Optionally, the adenoviral vectors of the invention 
may be delivered in a single administration or in various combination regimens, e.g., 
in combination with a regimen or course of treatment involving other active 
ingredients or in a prime-boost regimen. A variety of such regimens have been 
described in the art and may be readily selected. 

For example, prime-boost regimens may involve the administration of 
a DNA (e.g., plasmid) based vector to prime the immune system to a second or 
further, booster, administration with a traditional antigen, such as a protein or a 
recombinant virus carrying the sequences encoding such an antigen. See, e.g., WO 
00/1 1 140, published March 2, 2000, incorporated by reference. Alternatively,' an 
immunization regimen may involve the administration of a chimeric adenoviral vector 
of the invention to boost the immune response to a vector (either viral or DNA-based) 
carrying an antigen, or a protein. In still another alternative, an immunization 
regimen involves administration of a protein followed by booster with a vector 
encoding the antigen. 

In one embodiment, the invention provides a method of priming and 
boosting an immune response to a selected antigen by delivering a plasmid DNA 
vector carrying said antigen, followed by boosting with an adenoviral vector of the 
invention. In one embodiment, the prime-boost regimen involves the expression of 
multiproteins from the prime and/or the boost vehicle. See, e.g., R.R. Amara, 
Science, 292:69-74 (6 April 2001) which describes a multiprotein regimen for 
expression of protein subunits useful for generating an immune response against HIV 
and SIV. For example, a DNA prime may deliver the Gag, Pol, Vif, VPX and Vpr 
and Env, Tat, and Rev from a single transcript. Alternatively, the SIV Gag, Pol and 
HIV-1 Env is delivered in a recombinant adenovirus construct of the invention. Still 
other regimens are described in WO 99/16884 and WO 01/54719. 

However, the prime-boost regimens are not limited to immunization 
for HIV or to delivery of these antigens. For example, priming may involve 
delivering with a first vector of the invention followed by boosting with a second 
vector, or with a composition containing the antigen itself in protein form. In one 
example, the prime-boost regimen can provide a protective immune response to the 
virus, bacteria or other organism from which the antigen is derived. In another 
desired embodiment, the prime-boost regimen provides a therapeutic effect that can 
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be measured using convention assays for detection of the presence of the condition 
for which therapy is being administered. 

The priming composition may be administered at various sites in the 
body in a dose dependent manner, which depends on the antigen to which the desired 
.mmune response is being targeted. The invention is not limited to the amount or 
■do. of injection(s) or to the pharmaceutical carrier. Rather, the regimen may 
■nvolve a priming and/or boosting step, each of which may include a single dose or 
dosage that is administered hourly, daily, weekly or monthly, or yearly. As an 
example, the mammals may receive one or two doses containing between about 1 0 ug 
to about 50 ug of plasmid in carrier. A desirable amount of a DNA composition 
ranges between about 1 ug to about 10,000 ug of the DNA vector. Dosages may vary 
from about 1 Mg t0 1000 ug DNA per kg of subject body weight. The amount or site 
of delivery is desirably selected based upon the identity and condition of the mamma. 
The dosage unit of the vector suitable for delivery of the antigen to the mammal is 
descnbed herein. The vector is prepared for administration by being suspended or 
d,ssolved in a pharmaceutical^ or physiologically acceptable carrier such as isotonic 
saline; isotonic salts solution or other formulations that will be apparent to those 
skilled in such administration. The appropriate carrier will be evident to those skilled 
m the art and will depend in large part upon the route of administration. The 
compositions of the invention may be administered to a mammal according to the 
routes described above, in a sustained release formulation using a biodegradable 
biocompatible polymer, or by on-site delivery using micelles, gels and liposomes 
Optionally, the priming step of this invention also includes administering with the 
priming composition, a suitable amount of an adjuvant, such as are defined herein. 

Preferably, a boosting composition is administered about 2 to about 
27 weeks after administering the priming composition to the mammalian subject The 
administration of the boosting composition is accomplished using an effective amount 
of a boosting composition containing or capable of delivering the same antigen as 
administered by the priming DNA vaccine. The boosting composition may be 
composed of a recombinant viral vector derived from the same viral source (e g 
adenoviral sequences of the invention) or from another source. Alternatively the 
"boosting composition" can be a composition containing the same antigen as encoded 
in the priming DNA vaccine, but in the form of a protein or peptide, which 
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composition induces an immune response in the host. In another embodiment the 
boosting composition contains a DNA sequence encoding the antigen under the 
control of a regulatory sequence directing its expression in a mammalian cell e g 
vectors such as well-known bacterial or viral vectors. The primary requirements of 
the boosting composition are that the antigen of the composition is the same antigen 
or a cross-reactive antigen, as that encoded by the priming composition. 

In another embodiment, the adenoviral vectors of the invention are 
also well suited for use in a variety of other immunization and therapeutic regimens 
Such regimens may involve delivery of adenoviral vectors of the invention 
simultaneously or sequentially with Ad vectors of different serotype capsids, 
regimens in which adenoviral vectors of the invention are delivered simultaneously or 
sequentially with non-Ad vectors, regimens in which the adenoviral vectors of the 
invention are delivered simultaneously or sequentially with proteins, peptides, and/or 
other b,ologically useful therapeutic or immunogenic compounds. Such uses will be 
readily apparent to one of skill in the art. 

IV. Simian Adenovirus 1 8 Sequences 

The invention provides nucleic acid sequences and amino acid sequences of 
Ad SAI8, which are isolated from the other viral material with which they are 
associated in nature. These sequences are useful in preparing heterologous molecules 
containing the nucleic acid sequences and amino acid sequences, and regions or 
fragments thereof as are described herein, viral vectors which are useful for a variety 
of purposes, including the constructs and compositions, and such methods as are 
described herein for the chimeric adenoviruses, including, e.g., in host cells for 
production of viruses requiring adenoviral helper functions, as delivery vehicles for 
heterologous molecules such as those described herein. These sequences are also 
useful in generating the chimeric adenoviruses of the invention. 

A. Nucleic Acid Sequences 

The SA18 nucleic acid sequences of the invention include nucleotides 
SEQ ID NO: 12, nt 1 to 31967. See, Sequence Listing, which is incorporated by 
reference herein. The nucleic acid sequences of the invention further encompass the 
strand which is complementary to the sequences of SEQ ID NO: 12, as well as the 
RNA and cDNA sequences corresponding to the sequences of these sequences figures 
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and .her complement strands. Further included in mis invention are nucleic acid 
sequences which are graater than 95 to 98%, and more pmferably abou. 99 .o 99 9% 
homologous or identical to the Sequence Listing. Also induded in the nucleic acid 
sequences of*, invention are natura, variants and engineered modifications of me 
S eque„cesp ro videdi„SEQ.DNO:,2and.heircomp te men^ slra „ d , Such 
modifications include, for example, iabels ma, are Wn in the an, methvlation. and 
subs„.u„o„ of one or more of the naturally occurring uuc.eo.ides with , * 
nucleotide. 

The invention further encompasses fragments of the sequences of 
SA18, the,r complementary strand, cDNA and RNA complementary thereto. Suitable 
fragments are at least ,5 nuc.eotides in length, and encompass functiona. fragments 
■•e., fragments which are of biological interest. For example, a functiona. fragment ' 
can express a desired adenoviral product or may be useful in production of 
recombinant vira. vectors. Such fragments include the gene sequences and fragments 
listed in the tables below. 

The following tables provide the transcript regions and open reading 
frames m the simian adenovirus sequences of the invention. For certain genes the 
transcripts and open reading frames (ORFs) are located on the strand complementary 
tothatpresentedinSEQlDNO:,2. See, e.g., E2b, E4 and E2a. The calculated 
molecular weights of the encoded proteins are also shown. 
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The SA18 adenoviral nucleic acid sequences are useful as therapeut. 
and immunogenic agents and in construction of a variety of vector systems and host 
cells. Such vectors are useful for any of the purposes described above for the 
chimeric adenovirus. Additionally, these SA18 sequences and products may be used 
alone or in combination with other adenoviral sequences or fragments, or in 
combination with elements from other adenoviral or non-adenoviral sequences The 
adenov,ral sequences of the invention are also useful as antisense delivery vectors 
gene therapy vectors, or vaccine vectors, and in methods of using same. Thus the' . 
mvenuon further provides nucleic acid molecules, gene delivery vectors, and host 
cells which contain the Ad sequences of the invention. 

For example, the invention encompasses a nucleic acid molecule 
containing simian Ad ITR sequences of the invention. In another example, the 
•nvention provides a nucleic acid molecule containing simian Ad sequences of the 
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invention encoding a desired Ad gene product. Still other nucleic acid molecule 
constructed using the sequences of the invention will be readily apparent to one of 
skill in the art, in view of the information provided herein. 

In one embodiment, the simian Ad gene regions identified herein may 
be used in a variety of vectors for delivery of a heterologous molecule to a cell 
Examples of such molecules and methods of delivery are provided in Section III 
herein. For example, vectors are generated for expression of an adenoviral capsid 
protein (or fragment thereof) for purposes of generating a viral vector in a packaging 
host cell. Such vectors may be designed for expression in trans. Alternatively such 
vectors are designed to provide cells which stably contain sequences which express 
des,red adenoviral functions, e.g., one or more of Ela, Elb, the terminal repeat 
sequences, E2a, E2b, E4, E40RF6 region. 

In addition, the adenoviral gene sequences and fragments thereof are 
useful for providing the helper functions necessary for production of helper- 
dependent viruses (e.g., adenoviral vectors deleted of essential functions or adeno- 
associated viruses (AAV)). For such production methods, the simian adenoviral 
sequences of the invention are utilized in such a method in a manner similar to those 
described for the human Ad. However, due to the differences in sequences between 
the suman adenoviral sequences of the invention and those of human Ad, the use of 
the sequences of the invention essentially eliminate the possibility of homologous 
recombination with helper functions in a host cell carrying human Ad El functions 
e.g., 293 cells, which may produce infectious adenoviral contaminants during rAAV 
production. 

Methods of producing rAAV using adenoviral helper functions have 
been described at length in the literature with human adenoviral serotypes. See e g 
US Patent 6,258,595 and the references cited therein. See, also, US Patent 5 871 982- 
WO 99/14354; WO 99/15685; WO 99/47691. These methods may also be used in ' 
products of non-human serotype AAV, including non-human primate AAV 
serotypes. The simian adenoviral gene sequences of the invention which provide the 
necessary helper functions (e.g., Ela, Elb, E2a and/or E4 ORF6) can be particularly 
useful in providing the necessary adenoviral function while minimizing or eliminating 
the pos S ,b,lity of recombination with any other adenoviruses present in the rAAV- 
packaging cell which are typically of human origin. Thus, selected genes or open 
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reading ftan.es of the adenovira, s« of*. invent may be utilized i„ „ rae 
rAAV production methods. 

Alttr " atiTC| y.^"tbi„a„,aderK,v i r.Ui mianvec , orsof , heinvent 
may e „„„«d in tKeae memods. Such mcombinan, adenoviral simian vectors may 
■nclude, e g., a hybrid simian Ad/AAV in which simian Ad sequences flank a rAAV 
express™ c^e-e composed of, eg.. AAV and/or 5' ,TRs and a transgene under 
•he control of regulatory sequences which control irs expression. One of skill |„ me 
en w„, recognize that still other ^ ^ ^ ^ 

the mvenrton win be uscm, forproducion of rAAV and other viruses dependen, up0 „ 
adenoviral helper. F 

In still another embodiment, nucleic acid molecules are designed for 
dehvery and expression of severed adenoviral gene p^ducts in a host cell * achieve 
a des,red physiologic effect. For example, a nucleic aeid molecule confining 
sequences encoding an adenovirus EL protein of the invention may be de.ivemd to a 
subject for use as a cancer therapeutic. Optionally, such a molecule is foliated in a 
hptd-based earner am! preferentially ^ cancer cells. Such a formulation may be 
combated with other cancer rherapeutics (e.g., cisplatin, tao ,, or ^ like , Sti „ ^ 
uses for me adenoviral sequences provided herein will be readily appa™, to one of 
skill m the art. 

In addition, one of skill in the art will readily understand that the Ad 
sequences of the invention can be readily adapted for use for a variety of viral and 
non-vira. vector syste ms for in vitro, ex vivo or in vivo delivery of therapeutic and 
■rnmunogenic molecules, including any of those identified as being deliverable via the 
chunenc adenoviruses of the invention. For example, the simian Ad genome of the 
■nvent.on can be utilized in a variety of rAd and non-rAd vector system, Such 
vectors systems may include, e.g., plasmids, lentiviruses, retroviruses, poxviruses 
vaccinia viruses, and adeno-associated viral systems, among others. Selection of these 
vector systems is not a limitation of the present invention. 

The invention further provides molecules useful for production of the 
s.m,an and simian-derived proteins of the invention. Such molecules which carry 
polynucleotides including the simian Ad DNA sequences of the invention can be in 
the form of a vector. 
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B. Simian Adenoviral Proteins of the Invention 

The invention further provides gene products of the above 
adenoviruses, such as proteins, enzymes, and fragments thereof, which are encoded 
by the adenovira. nucleic acids of the invention. The invention further encompasses 
SA18 proteins, enzymes, and fragments thereof, having the amino acid sequences 
encoded by these nucleic acid sequences which are generated by other methods. Such 
protons indude those encoded by the open reading frames identified in the tables 
above, and fragments thereof. 

Thus, in one aspect, the invention provides unique simian adenoviral 
protons which are substantially pure, i.e., are free of other viral and proteinaceous 
protons. Preferably, these proteins are at .east 10% homogeneous, more preferably 
60% homogeneous, and most preferably 95% homogeneous. 

In one embodiment, the invention provides unique simian-derived 
capsid proteins. As used herein, a simian-derived capsid protein includes any 
adenoviral capsid protein that contains a SA1 8 capsid protein or a fragment thereof 
as defined above, including, without limitation, chimeric capsid proteins, fusion 
protons, artificial capsid proteins, synthetic capsid proteins, and recombinant^ capsid 
protons, without limitation to means of generating these proteins. 

Suitably, these simian-derived capsid proteins contain one or more 
SA 1 8 regions or fragments thereof (e.g., a hexon, penton, fiber or fragment thereof) in 
combination with capsid regions or fragments thereof of different adenoviral 
serotypes, or modified simian capsid proteins or fragments, as described herein A 
"modification of a capsid protein associated with altered tropism" as used herein 
mcludes an altered capsid protein, i.e, a penton, hexon or fiber protein region or 
fragment thereof, such as the knob domain of the fiber region, or a polynucleotide 
encoding same, such that specificity is altered. The simian-derived capsid may be 
constructed with one or more of the simian Ad of the invention or another Ad 
serotypes which may be of human or non-human origin. Such Ad may be obtained 
from a variety of sourees including the ATCC, commercial and academic sources or 
the sequences of the Ad may be obtained from GenBank or other suitable sources! 

The amino acid sequences of the simian adenoviruses penton proteins 
of the mvention are provided herein. The AdSA18 penton protein is provided in SEQ 
ID NO: 13. Suitably, any of these penton proteins, or unique fragments thereof, may 

43 



be uuhzed for a variety of purposes. Examples of suitable fragments include the 
penton having N-termina. and/or C-termina. truncations of about 50, 100, 150 or 200 
ammo acids, based upon the amino acid numbering provided above. Other suitable 
fragments include shorter internal, C-termina!, or N-termina! fragments. Further the 
penton protein may be modified for a variety of purposes known to those of skil.in 
the art. . 

The invention further provides the amino acid sequences of the hexon 
proton of SA18, SEQ ID NO:14. Suitably, this hexon protein, or unique fragments 
thereof may be utilized for a variety of purpose, Examples of suitable fragments 
•nclude the hexon having N-termina. and/or C-termina. truncations of about 50 100 
150, 200, 300, 400, or 500 amino acids, based upon the amino acid numbering ' ' 
provided above and in SEQ ID NO: 14. Other suitable fragments include shorter 
internal, C-terminal, or N-termina. fragments. For example, one suitable fragment the 
loop region (domain) of the hexon protein, designated DEI and FG1 or a 
hypervariable region thereof. Such fragments include the regions spanning amino 
ac,d residues about 125 to 443; about 138 to 441, or smaller fragments, such as those 
spannmg about residue 138 to residue 163; about 170 to about 176; about 195 to 
about 203; about 233 to about 246; about 253 to about 264; about 287 to about 297- 
about 404 to about 430, about 430 to 550, about 545 to 650; of the simian hexon ' 
proteins, with reference to SEQ ID NO: 14. Other suitable fronts may be readily 
•dentified by one of skill in the art. Further, the hexon protein may be modified for a 
vanety of purposes known to those of skill in the art. Because the hexon protein is 
the determinant for serotype of an adenovirus, such artificial hexon proteins would 
result m adenoviruses having artificial serotypes. Other artificial capsid proteins can 
also be constructed using the chimp Ad penton sequences and/or fiber sequences of 
the invention and/or fragments thereof. 

In one example, it may be desirable to generate an adenovirus having 
an altered hexon protein utilizing the sequences of a hexon protein of the invention 
One suitable method for altering hexon proteins is described in US Patent 5 922 315 
wh,ch ,s incorporated by reference. In this method, at least one loop region of the ' 
adenovirus hexon is changed with at least one loop region of another adenovirus 
serotype. Thus, at least one loop region of such an altered adenovirus hexon protein 
■s a sunian Ad hexon loop region of the invention. In one embodiment, a loop region 
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of the SA18 hexon protein is replaced by a loop region from another adenovirus 
serotype. In another embodiment, the loop region of the SA18 hexon is used to 
replace a loop region from another adenovirus serotype. Suitable adenovirus 
serotypes may be readily selected from among human and non-human serotypes as 
described herein. SA18 is selected for purposes of illustration only; the other simian 
Ad hexon proteins of the invention may be similarly altered, or used to alter another 
Ad hexon. The selection of a suitable serotype is not a limitation of the present 
mvention. Still other uses for the hexon protein sequences of the invention will be 
readily apparent to those of skill in the art. 

The invention further encompasses the fiber proteins of the simian 
adenoviruses of the invention. The fiber protein of AdSAl 8 has the amino acid 
sequence of SEQ ID NO: 15. Suitably, this fiber protein, or unique fragments thereof 
may be utilized for a variety of purposes. One suitable fragment is the fiber knob 
wh.ch spans about amino acids 247 to 425 of SEQ ID NO: 15. Examples of other 
suitable fragments include the fiber having N-termina. and/or C-termina! truncations 
of about 50, 100, 150, or 200 amino acids, based upon the amino acid numbering 
provided above and in SEQ ID NO: 1 5. Still other suitable fragments include internal 
fragments. Further, the fiber protein may be modified using a variety of techniques 
known to those of skill in the art. 

The invention further encompasses unique fragments of the proteins 
of the invention which are at least 8 amino acids in length. However, fragments of 
other desired lengths can be readily utilized. In addition, the invention encompasses 
such mod.fications as may be introduced to enhance yield and/or expression of an 
SA18 gene product, e.g., construction of a fusion molecule in which all or a fragment 
of the SA18 gene product is fused (either directly or via a linker) with a fusion 
partner to enhance. Other suitable modifications include, without limitation, 
truncation of a coding region (e.g., a protein or enzyme) to eliminate a pre- or pro- 
protein ordinarily cleaved and to provide the mature protein or enzyme and/or 
mutation of a coding region to provide a secretable gene product. Still other 
modifications will be readily apparent to one of skill in the art. The invention further 
encompasses proteins having at least about 95% to 99% identity to the SA1 8 proteins 
provided herein. 



45 



As described herein, vectors of the invention containing the 
adenoviral capsid proteins of the invention are particularly well suited for use in 
applications in which the neutralizing antibodies diminish the effectiveness of other 
Ad serotype based vectors, as well as other viral vectors. The rAd vectors of the 
invention are particularly advantageous in readministration for repeat gene therapy or 
for boostmg immune response (vaccine titers). Examples of such regimens are 
provided herein. 

Under certain circumstances, it may be desirable to use one or more of 
the SA18 gene products (e.g., a capsid protein or a fragment thereof) to generate an 
antibody. The term "an antibody," as used herein, refers to an immunoglobulin 
molecule which is able to specifically bind to an epitope. Thus, the antibodies of the 
invention bind, preferably specifically and without cross-reactivity, to a SA1 8 
epitope. The antibodies in the present invention exist in a variety of forms including 
for example, high affinity polyclonal antibodies, monoclonal antibodies, synthetic ' 
anybodies, chimeric antibodies, recombinant antibodies and humanized antibodies 
Such ant.bodies originate from immunoglobulin classes IgG, IgM, IgA, IgD and IgE. 

Such antibodies may be generated using any of a number of methods 
know in the art. Suitable antibodies may be generated by well-known conventional 
techniques, e.g. Koh.er and Milstein and the many known modifications thereof 
S.m.Iarly desirable high titer antibodies are generated by applying known 
recombinant techniques to the monoclonal or polyclonal antibodies developed to 
these antigens [see, e.g., PCT Patent Application No. PCT/GB85/00392; British 
Patent Application Publication No. GB2 1 88638A; Amit et al. , 1 986 Science 
233:747-753; Queen et al., 1989 Proc. Natl Acad. Sci. USA, 5* 10029-1 0033- PCT 
Patent Application No. PCT/WO9007861 ; and Riechmann et al., Nature 332 323. 
327 (1988); Huse et al, 1988a Science, 2*5:1275-1281]. Alternatively, antibodies can 
be produced by manipulating the complementarity determining regions of animal or 
human antibodies to the antigen of this invention. See, e.g., E. Mark and Padlin 
"Humamzation of Monoclonal Antibodies", Chapter 4, The Handbook of 
Experimental Pharmacology, Vol. 113, The Pharmacology of Monoclonal Antibodies 
S P ringer-Verlag(June, ,994); Harlow et al, 1999, Using Antibodies: A Laboratory ' 
Manual, Cold Spring Harbor Laboratory Press, NY; Harlow etal., 1989, Antibodies- 
A Laboratory Manual, Cold Spring Harbor, New York; Houston et al, 1988 Proc 
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Natl. Acad. Sci. USA *5:5879-5883; and Bird et ai, 1988, Science 242:423-426 
Further provided by the present invention are anti-idiotype antibodies (Ab2) and anti- 
anti-idiotype antibodies (Ab3). See, e.g., M. Wettendorff et a!., "Modulation of anti- 
tumor immunity by anti-idiotypic antibodies." In Idiotypic Network and Diseases ed 
by J. Cerny and J. Hiernaux, 1990 J. Am. Sac. Microbiol., Washington DC: PP 203- ' 
229]. These anti-idiotype and anti-anti-idiotype antibodies are produced using 
techniques well known to those of skill i n the art. These antibodies may be used for a 
vanety of purposes, including diagnostic and clinical methods and kits. 

Under certain circumstances, it may be desirable to introduce a 
detectable label or a tag onto a SA18 gene product, antibody or other construct of the 
■nvenfon. As used herein, a detectable label is a molecule which is capable, alone or 
upon mteraction with another molecule, of providing a detectable signal Most 
des.rably, the label is detectable visually, e.g. by fluorescence, for ready use in 
immunohistochemical analyses or immunofluorescent microscopy. For example 
suitable labels include fluorescein isothiocyanate (FITC), phycoerythrin (PE) 
allophycocyanin (APC), coriphosphine-O (CPO) or tandem dyes, PE-cyanin-5 (PCS) 
and PE-Texas Red (ECD). All of these fluorescent dyes are commercially available ' 
and the.r uses known to the art. Other useftil labels include a colloidal gold label 
St,ll other useful labels include radioactive compounds or elements. Additionally 
labels include a variety of enzyme systems that operate to reveal a co.orimetric signal 
in an assay, e.g., glll cose oxidase (which uses glucose as a substrate) releases 
peroxide as a product which in the presence of peroxidase and a hydrogen donor such 
as tetramethyl benzidine (TMB) produces an oxidized TUB that is seen as a blue 
color. Other examples include horseradish peroxidase (HRP) or alkaline phosphatase 
(AP), and hexokinase in conjunction with glucose-6-phosphate dehydrogenase which 
reacts with ATP, glucose, and NAD+ to yield, among other products, NADH that is 
detected as increased absorbance at 340 nm wavelength. 

Other label systems that are utilized in the methods of this invention 
are detectable by other means, e.g., colored latex microparticles [Bangs Laboratories 
Ind.ana] in which a dye is embedded are used in place of enzymes to form conjugates 
w.th the target sequences provide a visual signal indicative of the presence of the 
resulting complex in applicable assays. 
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are ml. y conventional and toown t0 ^ of s|d|| „ ^ ^ ^ mcihods 

label anachmen, a re described [see, for example. Handbook of Fluorescen. probes 
and Research Chemicals, 6th Ed., R. p. M. Haugland, Molecular Probes ton 
Eugene, OR, ,9,6; Pierce Catotog and Handbook, Life Science and Analytical 
Research Products, Pierce Chemical Company, Rockford, IL, ,994/1995] Thus 
selection of ,he label and coupling metoods do no, limit .his invention. ' * 

The sequences, proteins, and hagmenta of the invention may be 
produced by any suitable means, including recombinan, p roduclion , chemica| 
synthesis .oromersyntheticmeans. Suiuble predion Uniques are we „ ^ 
to those of stall in .heart. See, e.g., Sambreok e, al, Molecular Cloning- A 
Laboranny Manual, Cold Spring Harbor Press (Cold Spring Harbor NY) 
Alternatively, peptide, can also be syntoesized by toe well known solid phase peptide 
synthea* metoods (Menifield, J. An, Ckm. Sac ., SS:2149 (m2); ^ J 
Young, Solid Phase Peptide Synthesis (Freeman, Sa„ Francisco, 1969, pp 27-62) 
These and otoer suitoble production metoods are witoin toe knowledge of .hose of 
sk,ll ,„ toe an and are no. a limitation „f , h e presenl invention. 

In addition, one of skill in toe art will readily understand *„ toe Ad 
sequences of toe invention can be readily adapted for use for a varied of viral and 
non-viral vector systems for to vtoo, a, wVo or to v,vo deliver of toerapeutic and 
■"■monogenic molecules. For example, in one en.bodin.en,, toe s.mian Ad capsid 
proteins and otoer simian adenovirus pretoins described herein are used for non-viral 
pretoin^aeddeliveryof genes, P«.ina, and otoer deaireWe diagnostic, to OT pe Utic ' 
and immunogenic moiecules. In one such embodiment a protoin of toe invention is 
hnked, d,rec,ly or indirectiy, u, a molecule for targeting te cells wi,h a receptor for 
adenov.mses. Preferably, a capsid prorein such as a hexon, pemon, fib er o, a £hgmeM 
thereof havmg a ligand for a cell surface receptor is seiectod for such targeting 
Suitable molecules for deliver are aelectod torn among toe toetapeuuc molecules 
deacrtoed herein and toeir gene products. A variety of linkers including lipids 
polyLys, and toe like may be utilized as .inkers. For example, toe simian pemon 
protein may be readily utibzed for such , purpose by production of a fusion p rol ei„ 
us,ng toe simian pemon sequences in a manner analogous to ton. described in 
Meuina-Kauwe LK, « ... Gene Ther. 2001 May; 8 (,0):795. 8 03 and Medina-Kauwe 
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6t ° ene ^ 2001 Dec > 8 < 23 > : '753-1761. Alternatively, the amino acid 
sequences of simian Ad protein IX may be utilized for targeting vectors to a cel. 
surface receptor, as described in US Patent App.n 20010047081. Suitable ligands 
include a CD40 antigen, an RGD-containing or po.y.ysine-containing sequence, and 
the hke. Still other simian Ad proteins, including, e.g., the hexon protein and/or the 
fiber protein, may be used for used for these and similar purposes. 

Still other adenoviral proteins of the invention may be used as alone 
or ,n combination with other adenoviral protein, for a variety of purposes which wili 
be readily apparent to one of skill in the art. In addition, still other uses for the 
adenoviral proteins of the invention will be readily apparent to one of ski,, in the art. 

The compositions of this invention include vectors that deliver a 
heterologous molecule to cells, either for therapeutic or vaccine purposes Such 
vectors, containing simian adenovirus DNA of SA1 8 and a minigene, can be 
constructed using techniques such as those described herein for the chimeric 
adenoviruses and such techniques as are known in the art. Alternatively, SA19 may 
be a source for sequences of the chimeric adenoviruses are described herein. 

The following examples illustrate construction and use of several 
chimeric viruses, including Pan5/Cl, hu5/Pan7 and hu5/SV25, and Pan6/Pan7 
However, these chimera are illustrative only and are not intended to limit the 
invention to those illustrated embodiments. 

Example 1 - Construction of Pan5/C1 Chimeric Simian Viruses 

Five different adenoviruses initially isolated from the chimpanzee, AdC68 
[US Patent 6,083,716], AdPanS, AdPan7, AdPan6 and AdCl [US Patent 6,083 716] 
have been sequenced. See, International Application No. PCT/US02/33645 filed 
November 2002 for the sequences of Pan5 [SEQ ID NO:l], Pan7 [SEQ ID NO 3] 
andPan6[SEQIDNO:2]. This application also provides sequent for SV1 SV25 
and SV39 [SEQ ID No. 4, 5, 6, respectively]. Sequence comparison of the capsid 
prote.n sequences predicted that AdCl clearly belonged to a different serological 
subgroup than the other four chimpanzee derived adenoviruses. 

However, attempts to cultivate AdCl in HEK293 cells revealed it to be 
fastidious in its growth characteristics (data not shown) and therefore possibly 
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unstable for use as . vecter using *. cunenfly available El complementing M „ 
1-nes. However, because of, he obvious sequence dissimilar!* of AdCl capsid 
protein se^ce from the „,„„ chimpanKe ^ ^ fe „ e|| ^ fc 
huA 5) chimeric adenovirus vectors were generated wi.h «be capsid characteristics of 
AdCl . fa v,ew of the above-mentioned drawbacks associated with only making 
hexon changes, more extensive replacement, were mad e in d, e chimera described 
herem, i.e., construction of chitneras where the replacemen, wen. beyond jus, ,he 
hexon, ,o achieve Pvo goal, The firs, was ,o determine whether making extended 
replacement would allow for ,he rescue of viruses conteining hexons of unrelated 
serous fct may no, o,herwise he amenable te rascue. The second goal was ,o test 
whether the grow* characterises of adenovirus ve«o re such as AdPanS, ma, have 
been found in our laboratory ,o be able ,o be grown ,„ high .her for the purpose of 
manufacture, would also be presen, in ,he chimeric vims, particularly when me hexon 
(and other capsid proteins, a re derived fern a virus such as AdCl ma, a re difficul, ,„ 
grow ,o a high yield in eel, lines such as HEK293. An added bonus of extending ,he 
placemen, ,„ indude me fiber protein would be ,o former i„c re ase me antigenic 
d,ss,mila,ity to beyond d,a, afforded by a hexon change alone. 

As an alternative to oblaining purified virus as source for adenoviral DNA ,„ 
sequence, we have resorted to cloning restric.ion fragmenls of viral DNA obteined 
from infected cells <"Hirt p rep »>. The fira, adenovirus we have sequenced in this way 
,s S,m,an Adenovirus. EcoRf digestion of ,he Simian Adenovirus yielded 7 
fragments, shotgun cloning yielded clones of ,he 5 internal fragments, which were 
cloned and seque^ed. Complefion of ,h. sequencing was carried ou, by walking 
towards each of me ends of me genome. The map of me genome la shown in Figure 



1. 

A 



Construction of Two PanS/Cl Chimeric Plasmids 
The overall approach towards constructing chimeric viruses was to 
first assemble the complete El deleted virus DNA into a single p.asmid flanked by 
recognition sites for the restriction enzyme Swal, digest the p.asmid DNA with Swal 
to release the virus DNA ends, and transfect the DNA into HEK293 cells to determine 
whether viable chimeric adenovirus could be rescued. Two chimeric virus plasmids 
were constructed, p5Cl short and P 5C1 long. 
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The plasmid pSClshort harbors an El deleted Pan5 virus where an 
internal 15226 bp segment (18332 - 33557) has been replaced by a functionally 
ana.ogous 141 27 bp (, 8531 - 32657) from AdCl . This results in the replacement of 
the Pan5 proteins hexon, endoprotease, DNA binding protein, 100 kD scaffolding 
proton, 33 kD protein, protein VIII, and fiber, as well as the entire E3 region with 
the homologous segment from AdCl. The Clal site at the left end of the AdCl 
fragment is at the beginning of the hexon gene and the resulting protein is identical to 
the CI hexon. The EcoRI site which constitutes the right end of the AdCl fragment 
is within the E4orf7partofthe AdCl. The right end was ligated to a PCR generated 
nght end fragment from AdPa„5 such that the regenerated orf 7-translation product is 
chimeric between AdPan5 and AdCl. 

The plasmid P 5C1 long harbors an El deleted Pan5 virus where an 
mternal 25603 bp segment (7955 - 33557) has been replaced by a functionally 
analogous 24712 bp (7946 - 32657) from AdCl . This results in the replacement of 
the AdPanS preterminal protein, 52/55 kD protein, penton base protein, protein VII 
Mu, and protein VI with those from AdCl in addition to those replaced in P 5Clshort 
The AscI site at the left end of the AdCl fragment is at the beginning of the DNA 
polymerase gene and results in a chimeric protein where the first 165 amino acids of 
the AdPanS DNA polymerase has been replaced by a 167 amino acid segment from 
AdCl DNA polymerase. In this N-terminal region, the homology between the 
AdPan5 and AdCl DNA polymerase proteins is 81% (72% identity). 

The plasmid pDVPSMlu which contains the left end of AdPanS was 
used as the starting plasmid for the chimeric vector construction. 

The plasmid pDVP5Mlu was made as follows. A synthetic DNA 
fragment harboring recognition sites for the restriction enzymes Smal, MIuI EcoRI 
and EcoRV respectively was ligated into P BR322 digested with EcoRI and Ndel so as 
to retain the origin of replication and the beta-Iactamase gene. The left end of Pan5 
extend.ng to the MIuI site (15135 bp) was cloned into this plasmid between the Smal 
and MIuI sites. The El gene was functionally deleted and replaced by a DNA 
fragment harboring recognition sites for the extremely rare cutter restriction enzyme 
s.tes I-Ceul and Pl-Scel). The 2904 base pairs of the right end of Pan-5 was PCR 
amplified using the primers P5L [GCG CAC GCG TCT CTA TCG ATG AAT TCC 
ATT GGT GAT GGA CAT GC, SEQ ID NO:7] and P5ITR [GCG CAT TTA AAT 
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z^ CA tr tat acc tca mc - seq ,d no:s) ^ ■ 

(Roche). The PCR product was cut with MU and Swal, and cloned between Mlul 
and EcoRV of pD VP5M,„ to yi a,d pPa„5M,u + RE. A 3193 bp flagmen, extending 
from the M, u ,sice(, 5 ,35) to the Cla, (18328) sireof Pa„5 wa S then inserted between 
the same S1 ,es of p Pa n5 MIu + RE,„ yield pPaoSCIa+RE. The 3671 bp Cla, (,853,) to 

^(^^".ofd.a^sc^^^ppja^^ 
Clal (161 1 1) and EeoRI (161 16) to yield pPanSC.delRI. The 10452 bp internal 
EcoR] fragment of the adenovirus CI (22202 - 32653) was cloned into the EcoRI site 

of pPa„5C.de,RI,oyie,dp5Cl s hor,.T„con a n,c,p5C.,on s , the AdCl replacement 
was further extended by replacing the AscI - Cla, ,0379 bp flagmen, of AdPan5 in 
pSC.ahort with the Add AscI - Cla, 10591 bp flagmen,. Finally a green fluorescent 
protetn (OFP) expression cassette was inserted into both P 5Clshort and pSC.long 
between me ,-Ceu, and Pl-Scel sites to yield pSClshortGFP and p5CI longQFP 
respectively. 

15 B. Rescue of chimeric Pan5/Cl recombinant vector adenoviruses 

The plasmids pSClshortGFP and pSCllongGFP were digested with 
the restriction enzy me Swal and transfected into HEK 293 cells. A typical adenovirus 
mduced cytopathic effect was observed. The rescue of recombinant chimeric 
adenovirus from the P 5C1 longGFP transfection was confirmed by collecting the 
supernatant from the transfection and re-infecting fresh cells which were found to be 
transduced as determined by GFP expression. Viral DNA prepared from the chimeric 
recombmant virus was digested with several restriction enzymes and found to have 
the expected pattern on electrophoresis (data not shown). 

The chimeric adenoviral construct with the shorter replacement 
pSClshort encodes the CI proteins hexon and fiber as well as the intervening open 
readmg frames for endoprotease, DNA binding protein, ,00 kDa scaffolding protein 
33 kDa protein, and protein VIII. (The E3 region is a.so 'included within this region 
but ,s unlikely to impact on the viability of the chimeric virus). When the 
replacement was extended to include the additional AdCl proteins pTP (preterminal 
protem), 52/55 kDa protein, penton base, protein VII, Mu, and protein VI, there was 
no d.fficulty in rescuing viable chimeric virus. In this experiment, the chimeric 
adenovirus construction strategy utilized the presence of AscI and Clal restriction 
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enzyme sites present on the genes for DNA polymerase and hexon respectively on 
both AdPan5 and AdCl. 

The reasons for the relatively higher yield of the chimeric virus 
compared to the wild-type AdCl virus are not clear. In the growth of the 5C1 
ch,meric virus in 293 cells, the adenoviral early region gene products of El and E4 
are derived from Ad5 and AdPanS respective.y. The El and E4 gene products bind 
reguiate and de-repress several cellular transcription complexes and coordinate their 
achv.ty towards viral multiplication. Thus it is possible that the El gene products 
supplied in irons from the 293 cells and the E4 gene products from AdPanS are more 
opumal in the human 293 cell background than are the equivalent AdCl gene 
products. This may also apply to the activity of the major late promoter whose 
acfv.ty is responsible for the transcription of the capsid protein genes. In the 
ch,meric virus, the major late promoter, and the protein IVa2 which transactivates it 
are derived from AdPanS. However the E2 gene products required for adenoviral ' 
DNA replication pTP and single-stranded DNA - binding protein are derived from 
AdCl. The adenoviral DNA polymerase, which complexes with P TP, is chimeric in 
Ad5Cl but mostly AdPan5 derived. 

Two plasmids have been constructed where the structural proteins 
denve from the chimpanzee adenovirus Pan 7 and Simian adenovirus 25 respectively 
and the flanking sequences are derived from human Ad5 (the commonly used vector ' 
stram). The Adhu5-Pan7 chimeric adenovirus has been rescued, demonstrating that 
the ch.meric virus construction method used to derive the Pan 5 - CI chimeric virus 
» broadly applicable. Fig. 2 provides the map of the recombinant Adhu5-SV25 
chuneric virus. The portion of the genome replaced by DNA from Pan7 is indicated. 

Example 2 - Pan5 - C1 chimeric vector of invention as a delivery vehicle 
for immunogenic compositions 

A Pan 5 (Simian adenovirus 22, a subgroup E adenovirus, also termed C5) 
- CI (S,m.an adenovirus 2 1 , a subgroup B adenovirus) chimeric expressing the Ebola 
virus (Zaire) glycoprotein (C5C1-CMVGP) was constructed as a model antigen in 
order to test the efficacy of the vector C5C1-CMVGP as a vaccine; this vector has 
been compared it to the AdhuS based vector (H5-CMVGP). Compared to H5- 
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CMVGP, *« C5C-CMVGP vector yielded o„ Iy a slig h tl y decreased .eve, of OP 
expression in transduced A549 cells. 

Thereafter, GP-specific T cell and B cel. responses elicited in B10BR mice 
vaccinated with either H5-CMVGP or C5C1-CMVGP vectors were compared The 
C5C1 .CMVGP vector appeared to induce lower frequencies of gamma interferon 
producing CD8+ T cells with kinetics slower than the H5-CMVGP vector as 
determined by intracellular cytokine staining using a H-2k restricted GP-specific 
pept.de as stimulant. The total IgG response to GP, measured by ELISA was 
equ.valent in serum from mice vaccinated with the C5C1 -CMVGP or the H5 
CMVGP vectors. However, the CSC 1 -CMVGP vector induced a more potent Thl 
type response while the H5-CMVGP vector stimulated a more balanced Th,/Th2 type 



response. 



Example 3 - Generation of Chimeric Pan6/Pan7 Vectors 

A panel of GFP expressing vectors were generated. This panel includes 
vectors that are chimeric between Pan 6 and Pan 7 where (a) the hexon protein of Pan 
7 was replaced by that of Pan 6 (termed C767), (b), the fiber protein of Pan7 was 
replaced by that of Pan 6 (termed C776), (c) both the hexon and fiber proteins of the 
Pan 7 vector have been replaced by those from Pan 6 (termed C766) 

The chimeric virus termed C767 was constructed essentially as described 
above for the C5C1C5 virus in Example , . However, due to substantia, homology 
between the Pan6 and Pan7 sequences 5' to the hexon sequence, it was not necessary 
to substitute the 5' end of the genome between the penton and the pol gene. 

The chimeric vector C767 was compared to the C776, C766, the parent C6, 
and the parent C7, each expressing GFP. 

Balb/C mice (25 per group) were immunized intramuscularly with either 
Pan 6 or Pan 7 (1 0'» particles). Re-administration (10" particles i.v., by tail vein 
-njection) was attempted 3 weeks .ater using each of the five GFP expressing vectors 
(C6-GFP, C7-GFP, and the three chimeric vectors). Three days later the level of liver 
transduction was estimated qua.itative.y by examining liver sections for the presence 
of GFP expression and quantitatively by estimating copies of GFP DNA by Taqman 
analysis. Administration of either one of the two chimpanzee adenovirus vectors does 
not affect the transduction efficiency of the other vector, while re-administration of 
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the same vector is severly compromised. The data showed that antibodies to both 
hexon and fiber are important in preventing re-administration of adenoviral vectors. 

5 All publications cited in this specification are incorporated herein by 

reference. While the invention has been described with reference to a particularly 
preferred embodiment, it will be appreciated that modifications can be made without 
departing from the spirit of the invention. Such modifications are intended to fall 
within the scope of the appended claims. 
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What is claimed is: 



1 • A method of efficiently culturing a chimeric adenovirus in a selected host 
cel., sa,d chimeric adenovirus being from a parental adenovirus strain incapable of 
efficient growth in said host cell, said method comprising the steps of: 
(a) generating a chimeric adenovirus comprising: 

(i) adenovirus sequences of the left terminal end and 
nght terminal end of a first adenovirus which grows in a selected host cell type said 
left end region comprising the 5' inverted terminal repeat (ITRs), and said right end 
reg,on comprising the 3' inverted terminal repeat (ITRs); and 

(ii) the internal regions from a parental adenovirus 
which lacks its native 5' and 3' terminal regions, said internal regions comprising the 
late genes encoding the penton, hexon, and fiber; 

wherein the resulting chimeric adenovirus comprises, from 5' 
to 3', a left terminal region of the first adenovirus, the internal region of the parental 
adenov,rus, and the right terminal region of the first adenovirus; and 

b) culturing said chimeric adenovirus in the presence of functional 
adenovirus Ela, Elb, and E4 ORF6 genes from the first adenovirus or from an 
adenovirus serotype which transcomplements the first adenovirus, and further in the 
presence of necessary adenoviral structural genes from the left end of the adenovirus. 

2. The method according to claim 1, wherein the internal region of the 
parental adenovirus further comprises one or more functional adenovirus genes 
selected from the group consisting of Endoprotease open reading frame, DNA 
binding protein, 100 IcDa scaffolding protein, 33 kDa protein, protein VIII, pTP, 
52/55 kDa protein, protein VII, Mu and protein VI. 

3. The method according to claim 1, wherein the polymerase, terminal 
protein and 52/55 kDa protein functions are provided in trans. 
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4. The method according to claim 1, wherein the first adenovirus further 
comprises the polymerase, terminal protein and 52/55 kDa protein functions. 

5. The method according to claim 1, wherein the chimeric adenovirus 
comprises the adenoviral late genes 1, 2, 3, 4, and 5 of the parental adenovirus. 

6. The method according to claim 1, wherein the selected host cell stably 
contains one or more of the adenovirus Ela, Elb or E4 ORF6 functions. 

7. The method according to claim 1, wherein the chimeric adenovirus 
comprises one or more of the adenovirus Ela, Elb or E4 ORF6 of the first 
adenovirus. 



8. The method according to claim 1, wherein the first adenovirus is of 
human origin. 



9. The method according to claim 1, wherein the first adenovirus is of 
simian origin. 

10. The method according to claim 1, further comprising the step of isolating 
the chimeric adenovirus. 

1 1 . A method for generating a chimeric adenovirus for growth in a selected 
host cell, said chimeric adenovirus being derived from a parental adenovirus strain 
mcapable of efficient growth in said host cell, said method comprising the step of 
generating a chimeric adenovirus comprising: 

5' and 3' terminal regions of a first adenovirus which grows in a 
selected host cel. type, said 5' terminal regions comprising the 5' inverted terminal 
repeat (iTRs) and necessary El gene functions, and said 3' terminal regions 
comprising inverted terminal repeat (ITRs) and necessary E4 gene functions; and 

internal regions from a parental adenovirus which lacks its native 5' 
and 3' terminal regions, said internal regions comprising the hexon, penton base and 
fiber; 
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wherein the resulting chimeric adenovirus comprises, from 5' to 3' the 5' 
terminal region ofthe first adenovirus, the internal region ofthe parental adenovirus 
and the 3 terminal regions of the first adenovirus. 

12. A chimeric adenovirus produced according to the method of claim 1 . 

13. A chimeric adenovirus comprising a hexon protein of a selected 
adenovirus serotype which is incapable of efficient growth in a selected host cel., 
said modified adenovirus comprising: 

(a) adenovirus sequences ofthe left terminal end of a first 
adenovirus which grows in a se.ected host eel, type, said left end region comprising 
the Ela, Elb and 5' inverted terminal repeat (ITRs); 

(b) adenovirus sequences ofthe internal region ofthe selected 
adenovirus serotype which is incapable of efficient growth in the selected host cell 
said interna, region comprising the genes encoding the penton, hexon and fiber ofthe 
selected adenovirus; 

(c) adenovirus sequences ofthe right terminal end ofthe first 
adenovirus, said right end region comprising the necessary E4 gene functions and the 
3 inverted terminal repeat (ITRs), 

wherein the resulting chimeric adenovirus comprises 
adenoviral structural and regulatory proteins necessary for infection and replication. 

14. The chimeric adenovirus according to claim 13, wherein the chimeric 
adenovirus further comprises the Ilia, 52/55kDa and terminal protein (pTP) ofthe 
selected adenovirus serotype. 

15. The chimeric adenovirus according to claim 13, wherein chimeric 
adenovirus comprises the polymerase ofthe first adenovirus. 

16. The chimeric adenovirus according to claim 13, wherein the chimeric 
adenovirus expresses a functional chimeric protein formed from the first adenovirus 
and the se.ected adenovirus, said chimeric protein is selected from the group 
consisting of polymerase, terminal protein, 52/55 kDa protein, and Ilia 
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17. The chimeric adenovirus according to claim 13, wherein the chimeric 
adenovirus comprises the terminal protein, 52/55 kDa, and/or Ilia of the selected 
adenovirus. 

18. A host cell comprising a chimeric adenovirus according to claim 12. 

19. The host cell according to claim 18, wherein said host cell is a human 

cell. 



20. An isolated simian adenovirus nucleic acid sequence selected from the 
group consisting of: 

(a) SA18 having the sequence of nucleic acids 1 to 31967 of SEO ID 

NO: 12 and 

(b) a nucleic acid sequence complementary to the sequence of any of 

(a)to(f). 



21. An isolated simian adenovirus serotype nucleic acid sequence selected 
from one or more of the group consisting of: 

(a) 5' inverted terminal repeat (ITR) sequences; 

(b) the adenovirus Ela region, or a fragment thereof selected 
from among the 13S, 12S and 9S regions; 

(c) the adenovirus Elb region, or a fragment thereof selected 
from among the group consisting of the small T, large T, IX, and IVa2 regions; 

(d) the E2b region; 

(e) the LI region, or a fragment thereof selected from among 
the group consisting of the 28.1 kD protein, polymerase, apoprotein, 52/55 kD 
protein, and Ilia protein; 

(0 the L2 region, or a fragment thereof selected from the 
group consisting of the penton, VII, VI, and Mu proteins; 

(g) the L3 region, or a fragment thereof selected from the 
group consisting of the VI, hexon, or endoprotease; 

(h) the 2a protein; 
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(i) the L4 region, or a fragment thereof selected from the 
group consisting of the 100 kD protein, the 33 kD homolog, and VIII; 

0) the E3 region, or a fragment thereof selected from the 
group consisting of E3 ORF1, E3 ORF2, E3 ORF3, E3 ORF4, E3 ORF5 E3 ORF6 
E3 ORF7, E3 ORF8, and E3 ORF9; 

(k) the L5 region, or a fragment thereof selected from a fiber 

protein; 

(I) the E4 region, or a fragment thereof selected from the 
group consisting of E4 ORF7, E4 ORP6, E4 ORF4, E4 ORF3, E4 ORF2 and E4 
ORFl;and 

(m) the 3' ITR, of any of SA18 SEQ ID NO: 12, or a sequence 
complementary to any of (a) to (m). 



22. A simian adenovirus protein encoded by the nucleic acid sequence 
according to claim 21. 



23. A composition comprising a simian adenovirus capsid protein according 
to claim 22 linked to a heterologous molecule for delivery to a selected host cell. 

24. A method for targeting a cell having an adenoviral receptor comprising 
delivering to a subject a composition according to claim 23. 

25. A nucleic acid molecule comprising a heterologous simian adenoviral 
sequence according to claim 21. 



26. The nucleic acid molecule according to claim 25, wherein said simian 
adenoviral sequence encodes an adenoviral gene product and is operatively linked to 
regulatory control sequences which direct expression of the adenoviral gene product 



in a host cells. 



27. The nucleic acid molecule according to claim 25, wherein said simian 
adenoviral sequence comprises the Ela region of SA18 SEQ ID NO: 12. 
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28. A pharmaceutical composition comprising the nucleic acid molecule 
according to claim 27 and a physiologically compatible carrier. 

29. A recombinant adenovirus having a capsid comprising a protein selected 
from the group consisting of: 

(a) a hexon protein of SA18, SEQ ID NO 13, or a unique fragment 

thereof; 

(b) a penton protein of SA1 8, SEQ ID NO: 14, or a unique fragment 

thereof; 

(c) a fiber protein of SA 1 8, SEQ ID NO: 1 5, or a unique fragment 

thereof. 



30. The recombinant adenovirus according to claim 29, wherein the capsid is 
of an artificial serotype. 



31. The recombinant adenovirus according to claim 29, wherein said virus 
further comprises a heterologous gene operatively linked to sequences which direct 
expression of said gene in a host cell. 

32. The recombinant adenovirus according to claim 29, further comprising 
5' and 3' adenovirus cis-elements necessary for replication and encapsidation. 

33. The recombinant adenovirus according to claim 29, wherein said vector 
lacks all or a part of the El gene. 

34. A host cell comprising a heterologous nucleic acid molecule comprising 
the nucleic acid sequence according to claim 2 1 . 

35. The host cell according to claim 34, wherein said host cell is stably 
transformed with the nucleic acid molecule. 
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36. The host cell according to claim 34, wherein said host cell expresses one 
or more adenoviral gene products from said nucleic acid molecule, said adenoviral 
gene products selected from the group consisting of Ela, Elb, E2a, and E4 ORF6. 

37. The host cell according to claim 34, wherein said host cell is stably 
transformed with a nucleic acid molecule comprising the simian adenovirus inverted 
terminal repeats. 

38. A composition comprising a recombinant virus according to claim 29 in 
a pharmaceutically acceptable carrier. 

39. A method for delivering a heterologous gene to a mammalian cell 
comprising introducing into said cell an effective amount of the recombinant virus 
according to claim 29. 

40. A method for repeat administration of a heterologous gene to a mamma, 
comprising the steps of: 

(a) introducing into said mammal a first vector which 
comprises the heterologous gene and 

(b) introducing into said mammal a second vector which 
comprises the heterologous gene; 

wherein at least the first virus or the second vector is a 
virus according to claim 29 and wherein the first and 
second recombinant vector are different. 

4 1 . A method for producing a selected gene product comprising infecting a 
mammalian cell with the recombinant virus according to claim 29, cu.turing said cell 
under suitable conditions and recovering from said cell culture the expressed gene 
product. 



42. A method for eliciting an immune response in a mammalian host against 
an mfective agent comprising administering to said host an effective amount of the 
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recombinant adenovirus of claim 29, wherein said heterologous gene encodes an 
antigen of the infective agent. 

43. The method according to claim 42, comprising the step of priming the 
host with a DNA vaccine comprising the heterologous gene prior to administering 
the recombinant adenovirus. 
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ABSTRACT OF THE DISCLOSURE 



A method for providing an adenovirus from a serotype which does not grow 
efficently in a desired cel. .ine with the ability to grow in that ce.l line is described 
The method involves replacing the left and right termini of the adenovirus with the " 
corresponding termini from an adenovirus which grow efficiently in the desired cell 
Ime. At a minimum, the left terminus spans the 5' inverted terminal repeat, the left 
terminus spans the E4 region and the 3' inverted terminal repeat. The resulting 
ch,meric adenovirus contains the internal regions spanning the genes encoding the 
penton, hexon and fiber from the serotype which does not grow efficiently in the 
desn-ed ce.l. Also provided are vectors constructed from novel simian adenovirus 
sequences and proteins, host cells containing same, and uses thereof 
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<U0> Wilson, James M. 
Roy, Soumitra 

<120> SSSc°LSSSS ChimeriC A< * enov * ruses and uses For Such 
<130> UPN-P3067 
<160> 15 

<170> Patentln version 3.2 

<210> l 

<211> 36462 

<212> DNA 

<213> chimpanzee adenovirus serotype Pan5 

<400> 1 



catcatcaac aatatacc.c aaacttttgg t g c 9 c gttaa tatgcaaatg a 99tatttg a 
atttggggat 9 c ggggc9gt gattgg c tgc g93 a g c ggcg acc g tt aggg 9 c ggggcggg 
tg-egttttg a t gacgtgg c c 9 t gagg c gg ag ccgg tttg c aagttctcg tggga aaa g t 
gaogtcaaac gaggtg tggt ttgaacacgg aaatactcaa tttteccgcg ctctctgaca 
99aaa t9a99 tgtttctggg. c 9gatgcaa9 tgaaaacggg ccattttcgc gcgaaaactg 
aatg aggaag tgaaaat c tg a gt aattcc g c gtCtatggc agggaggagt atttgccgag 
99cc ga3tag actttgaccg attacgtggg ggtctcgatt accgtatttt tcacctaaat 
t t cc g cg Cac ggtgtcaaag tccggtgttt ttacgtaggt gtcagctgafc cgccagggta 
tt taaacctg c gctCt c t a g tcaagaggcc a ctc ttgagt gc c agcgagt agagttttct 
cc t cc 9 cgcc gcga gtcag a tctacacttt g a aagatgag gcacctgaga gacctgcccg 
gftgtttt cctggctact ggg aac g a g a t tc t gg aact gg tggtggac gccatgatgg 
gtgacgaccc t cc ggagccc cctaccccat ttgaagcgcc t tcg c tgtac gatttgtatg 
atc tggaggt ggatgcgccc gagaacgacc ccaacgagga ggcgg C gaat gatttgttta 
9cga, g cc g c gctgctggct gccgagcagg ct a atac gga ctc tgg ctca gacagcgatt ' 
cctctctcca t accccgag a cccgg Ca g ag gtgagaaaaa gat ccccg a g ctt aaa gggg 
aa gagct c 9a cctgcgctgc t a t gagg aat gcttgcctcc gagcgatg a t gaggaggacg 
aggaggcgat tcgagctgca gcgaaccagg ga gtg aaaac a gcg a g c 9 a g gg ctt tagc c 
t gg actg tC c tactctgccc ggacacggct g taagtcttg tgaatttcat cgcatgaata 
ctggagataa 9 aat gtgatg tgtgccctgt gctatatgag agcttacaac c att g tgttt 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



1 



.=a 9 taa 9tg t 9attaactt tagotgggga Mcagagggt sactmatgc 
.« t « 9C . t « gt „ ttt . ^ cc9tctct9<> cgtagat9a3 ^ 
ca 9 a 9t9 ca C ttcatcaceo cca9 .„ tt? gcgagga _ goccsawt attactcata 
9 acc. 9Ctg c agcgagagtc accgggcgta gagcagct3t ^ wtgacttgc 
"o. 999 t 99 ^..^ et39a „ tgt gcaeccggaa acgccccj3g ^^^^^^ 
c.caca Cg t 9 tgtttactta aggtgatgtc agtacttata gggtgt3gag 
««. agtgc gtggtttatg actcaggggt ggggact9tg 

C ° t9t9t39t • — «-« 99 . 9 «c t99 a „ gtottgg 

aa 9 a« tt „ ccagactaga cagctgctag 

99 a 9 . tt c t9 « tcggtggg „ tctagcta agctagteta tagg9oc>a3 
a 99 at=.,t t tgaggatact ctgagagagt gtcctggtat ttttgacece 
9 ce. t c. 9 tc t ca=« taac cagagtattc tgagagc<;ct tgacetctcc actcctggca 
^ -ttBec ttttttgcc[ ttatccttga caaacggagt ^ 
a tCt oa 9 oa 9 ctggactg „ 

gaacg „ atc tccggcta „ tgcc . gcaca gccggtagac ^ 
ccc t9 a 9t « eeagtcacct: caggaacac£ aacgccgcca 
• 9 aa. 9 . 99 . 9gac<:g . gaa gag „ c „ ga gagccggtc£ 

a 99 . 99 a 9 t. 9 c tga c ttgt ttcccgagct gcgccgggtg ccgactaggt 
acgggagagg 99gattaagc gggagaggc< tgaggagace 
^. gC « g atgagtcgc . ggcgcacaga atcggtgtgg 
9999 a t , 9 , t 9aggt « cag cgatgcatg , gaaatacecc ctagMcMg 

ta 9gCt9gag ccc gaggatg attgggaggc agccaccagg aateatgoca agctggctct 
9 a 99 cc agae aagaagtaca agateaacaa actgaetaae g 
«c. g99aat ggggccgagg tggagaecag -^^^ 

~ t9 at 9aae atgeM<!cgg gggtggtggg caeggaggg> gtcacctcta cgaacgcgag 
gatgggtata atggggtggt ctttatggcc ^ ^ 

e "' t9 " CC ■ 9 « tg5OTC „ gtgttteage 

9 . 9999 a t9 c agettttcag „ aaccggac gggggtegcg ggcagaaocii >g ^ ^ 
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^ a " t9CCC9t ~ ~—» * M9 = 9 ., 9 c 

— «~ cachet ctaocs.,,0 99gccgctct gtactgatea 
ca.a 9t caa 9 cataatatga t „ gt9M9c cic99atgaa _ 

" 9 " C * 3 " — 
9 — 99 c« 9 , 9ttcg . gc ac .. cstc „ g>cccgct3c , itgtgcocc ^ 

co 9 , 99 c« 9 tccat9cccc acca9ta _ catgcaaett gcg>agg ^ ^ 

c 9 « 9 co. t9 tccagagcga gcctgac9gg acgaatgtag 

— 9 a 9 a t . tgatgaat coaa9 , cc . g gtgccgggco tgcgaaegcg g ^ 

« „ 9cccgcgt gcgt9ga9gt gaeggaggac ^ 

9 «*~t 9 e .ac 99ga09g agttC9gcto cagcggggaa gaatctwt 

9t9 « c999 . ct999t99ga gcotgcatga tgggcagaat gactaaaatc 

' WS " S C " 9a9C99 ' -~-« 99C a t c=a 9 c e« tat « g a 

■Mm. ccc ct « t99 9C999a9tgc 9tca9aacgc gatgggatcc ^ 

9CC " CCC9t 9CS9 ^ ^ . 9ct e= C c 9t 

~ 9c . agcC9co9cc 9cagcegctg cttccgecgc 

«c C999=go cggccacca= agctctctgg tggccaactc ^ 

cc 9 oc C9 „ wg9 , 9aag „ 9CC9oegc gctc ^ 

9 cc t999 c 9a 9ctgacccag cag9t99ccc ggagacgcgg 

a,« oa a ataa aaa . tg ,, tc „ taaataM eggagacggt tgttgac£cc 

— 9 a 9 t c« 9a , tctc tatetgactt tecgcgcgcg gtaggccctg ^ 

""""" 9a9 " CCC " MOT. 9 « C99atgc 

tmu. 999 c. tgagc cogtcoc999 99t9gaggta gctccateg= 

99tgttgtaa atcaccoagt wwaggg gegcagggcg t9geg ^ 
9 a 9ga9gag . otgat99eca cgggcagcca ctcg9t9tag gtgtegacga 
.oc t9tt9 a 9 c tgggaggga t9catgcgg9 ^ 

t 9 a 9 aa t99 c 9aegetcccg „ cagacccc ^ 

9C * C99C9 " tCC " t9 "= ' W <~* aa 99 c 9t9aa 
. 9 a.tt t9ga 9aogc „ ct9 tgacc9ceca ggttttecat gcactc>tcc atgatgatgg 

9tgg9cg9cg 9ctt99g _ agaagtttcg ggggtcggac ^ 
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OT9wtcg tcataggc „ tct9539cag i33gi3occg 

9 "" t9CCC tC9 ""^ «■•■«■» 9~ctc 9 ea 9 a t ct 9 catc t 
ccc.^cctt 9 a 9 «c ggag gggggg „ oa t9tccaoct9 c9g9gcgKg ^ 

'««^«99«.t»- w t s99 cc sl aaa 9 ca 99 tt -cane*. t 999 a« tg a 

^ " 39CC3t * 9 *«»=«9 tt9a999a9I 

S .c wt9 cc 9t cctc 9 = 99 a 99 . 99g9g9 ccaocccgtt catc „ etcg 

t-.ttcae** a.a gagttcc gccaggaggc gc „ 9ccccc aagcg>gagg agctct ^ 

•KM. 9ttttccagc ggcttgagcc agtcggcaat ggg<;aetttg gs9 ^ t 

9« g=aagag ltccagacgg tccca9a9ct cggtgatgtg cccc ^ 
9 ca 9 ac=cc= tc9 « t c 9 a 9 ggtCggggcg accgc9gg , g 

9t ca. 9 c 9ag 9 c cagggtcc 99 tcc tt cca 9g99cg „ gg 9tccgcgtea gcgtggtctc 
Ctcac^ aag9ggegcg CBCOg99ces 9sogctcgcg agggtgcgce ^^^^ 

CW * 9a9 " CC9Ct — «— « tc^cca^t . 9c ,. tegag 

" 9tt9 * 9C9 !" MMC t acct« 99 , 

t. 9 . 9ctt9g 

cu, 9 o. 9 a C99=9 cagacggtct . 

^ — 9 a 99 « eoctcc9t gccttcegat 
a« ctg9Cct cc.t 9 . 9 «c 9C9Cc ccc 9 a t9g9tgacaa . gag9ctgtc 
c 9 t 9t cccc 9 tagaccgacc ttat^ca, 9tC a t c 9 a 9 c 999gCgocgc ggteot<!9t<! 
9 " 9 a 99 .. c CCC9 « cact ocga9acgaa cmm ^ 

C " 9t9 "' 9 999t - 9C99t — a«««cc. . 999e . t9caa 

9 =.o.c 9t « ccc^tcc. catcca,,.. ^..^ 

aocM^tc ^wccsms 999 tataaa. 9999gcg9gc ccct^tc* coac.c t9t c 
tccc 99 . t c 9 c C9C co. 99a 9cgc<:a9 « 9 ttg999ta9g tattcoctcE C9aasgcggg 
ca^aactc, 9 ca«ca 99 t t9tcagttlc tagaaaC9 . g g . g9attcga 

gccatnm wttte . tgagccecCc gtccatctgg ^ g ^ 

9t t 9tC9 a 9 c tt9geg9cga .^.^ gagggcgttg 

t99ttotttt ccetgtc9ge gcgctccttg g<;ggagacgt 
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9taCtC9C9C 9CCaC9Ca " tC " ttC ^ *^ac ggtg gtgagcttgt cgggcacgat 
t c tg accc 9 c ca 9 cc 3 c ggt tgtgcagggt gatgaggtcc 

9c 9 ca gggg c tC9ttggt cc a g ca 3aggcg ccc 9 ccc ttg cgcgagcaga aggggggcag 
c 9ggt cca 9 c a t9agctcgt cgggggggtc ggcgtccacg 

ctc 9999tcg aa9tagctga tgcaggfcgcc cagatcgtcc agcgccgctt gccagtcgcg 

cac g9 cca gc 9 c 9 c 9 c t c 9t aggggctgag gggcgtgccc ca 
c 9 c gg a ggcg tacatgccgc agatgtcgta gacgtagagg ^^^^ 
9t a 99tgggg tagcagcgcc ccccgcggat gctggcgcgc acgfcagtcgt 
c 9a3g3 c gcg aggagcccgg tgccgaggtt ggagcgcfcgc ^^^^^ ^^^^ 
9a t c tggc99 aagatggcgt gggagttgga ggagatggtg ggccfcctgga agatgtfcgaa 
9t 9 39 c 9tgg ggcagtccga ccgagtcccfc gatgaagtgg gcgtaggagtcctgcagctt 
99c g ac gagc Ccg9 c ggtga cgaggacgtc cagggcgcag 

9at9tC9t3C tt9a9Ct " C — 9 c tC9 c 9gttgagaa ggaactcttc 
9c ggt cc ttc ca gtact ct t c gagggggaa cccgtcctga tcggcacggt 
ca t9tagaac cggttgacgg ccttgtaggc gcagcagccc ttctccacgg 

a 3 c ttgcg c 9 gccttgcgca gggaggtgtg ggtgagggcg aaggtgtcgc gcaccacgic 
c ttg a ggaac tggtgcttga agtcgaggtc gtcgcagccg ^^^^^ agagctg ^ 

9 t cc gtgcgc ttcttgt a 9g cggggttggg caaagcgaaa gtaacatcgt tgaagaggafc 
c ttg ccc gcg cggggcatga agttgcgagt gatgcggaaa ggcfcggggca ^^^^^ 
9 ttgttgatg acctgggcgg cgaggacgat ctcgtcgaag ccgttgatgfc ^^^^^ 

9at gt a gagt tccac 9 aatc g c ggg c 9g cc c ttgacgtgg gg ca g c ttct t 9 a g c t c g tc 
9 t a 9gtgagc tcggcggggt cgctgaggcc gtgctgctcg agggcccagt 

99 99 t t33 c 3 cc 9a9gaagg aagtccagag afcccacggcc agggcggtct gcaagcggfcc 

cc 99 tac tga cggaactgct g9 cccac gg c cattttttcg gggg , gacgc agtagaaggt 
9c 999ggt c 9 CC9tgccagc ggtcccactt gagctggagg gcgaggtcgt • 
9ac 3 a g c ggc gggtccccgg agagttfccat gaccagcafcg gctgctfcgcc 

9aa gg acccc a t cca 3gtgt aggttt ccac gtcgtaggtg aggaagagcc tttcggCgcg 
a99a tgCgag ccgatgggga agaactggafc 

9at gtgatgg aagtagaaat gccgacggcg cgccgagcac tcgtgcttgt gfcttatacaa 
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gcgtccgcag tgctcgcaac gctgcacggg atgcacgtgc tgcacgagct gtacctgggt 
tcctttgacg aggaatttca gtgggcagtg gagcgctggc ggctgcatct ggtgctgtac 
tacgtcctgg ccatcggcgt ggccatcgtc tgcctcgatg gtggtcatgc tgacgaggcc 
gcgcgggagg caggtccaga cctcggctcg gacgggtcgg agagcgagga cgagggcgcg 
caggccggag ctgtccaggg tcctgagacg ctgcggagtc aggtcagtgg gcagcggcgg 
cgcgcggttg acttgcagga gcttttccag ggcgcgcggg aggtccagat ggtacttgat 8340 
ctccacggcg ccgttggtgg cgacgtccac ggcttgcagg gtcccgtgcc cctggggcgc 
caccaccgtg ccccgtttct tcttgggtgc tggcggcggc ggctccatgc ttagaagcgg 
cggcgaggac gcgcgccggg cggcaggggc ggctcggggc ccggaggcag gggcggcagg 
ggcacgtcgg cgccgcgcgc gggcaggttc tggtactgcg cccggagaag actggcgtga 8580 
gcgacgacgc gacggttgac gtcctggatc tgacgcctct gggtg aa g 9C cacgggaccc 
gtgagtttga acctgaaaga gagctcgaca gaatcaatct cggtaccgtt gacggcggcc 
tgccgcagga tctcttgcac gtcgcccgag ttgtcctggt aggcgatctc ggtcatgaac 
tgcccgatct cctcctcctg aaggtctccg cgaccggcgc gctcgacggt ggccgcgagg 
tcgttggaga tgcggcccat gagctgcgag aaggcgttca tgccggcctc gttccagacg 
cggctgcaga ccacggctcc gtcggggtcg cgcgcgcgca tgaccacctg ggcgaggttg 
agctcgacgt ggcgcgtgaa gaccgcgtag ttgcagaggc gctggtagag gtagttgagc 
gtggtggcga tgtgctcggt gacgaagaag tacatgatcc agcggcggag cggcatctcg 
ctgacgtcgc ccagggcttc caagcgctcc atggcctcgt agaagtccac ggcgaagttg 
aaaaactggg agttgcgcgc cgagacggtc aactcctcct ccagaagacg gatgagctcg 
gcgatggtgg cgcgcacctc gcgctcgaag gccccggggg gctcctcttc ttccatctcc 9240 
tcctcctctt ccatctcctc cactaacatc tcttctactt cctcctcagg aggcggcggc 9300 
gggggagggg ccctgcgtcg ccggcggcgc acgggcagac . ggtcgatgaa gcgctcgatg 9360 
gtctccccgc gccggcgacg catggtctcg gtgacggcgc gcccgtcctc gcggggccgc 9420 
agcgtgaaga cgccgccgcg catctccagg tggccgccgg gggggtctcc gttgggcagg 9480 
gagagggcgc tgacgatgca tcttatcaat tggcccgtag ggactccgcg caaggacctg 9540 
agcgtctcga gatccacggg atccgaaaac cgctgaacga aggcttcgag ccagtcgcag 9600 
tcgcaaggta ggctgagccc ggtttcttgt tcttcgggta tttggtcggg aggcgggcgg 
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gcgatgctgc tggcgatgaa gttgaagtag gcggtcctga gacggcggat ggtggcgagg 9720 

agcaccaggt ccttgggccc ggcttgctgg atgcgcagac ggtcggccat gccccaggcg 9780 

tggtcctgac acctggcgag gtccttgtag tagtcctgca tgagccgctc cacgggcacc 9840 

tcctcctcgc ccgcgcggcc gtgcatgcgc gtgagcccga acccgcgctg cggctggacg 9900 

agcgccaggt cggcgacgac gcgctcggcg aggatggcct gctggatctg ggtgagggtg 9960 

gtctggaagt cgtcgaagtc gacgaagcgg tggtaggctc cggtgttgat ggtgtaggag 10020 

cagttggcca tgacggacca gttgacggtc tggtggccgg ggcgcacgag ctcgtggtac 10080 

ttgaggcgcg agtaggcgcg cgtgtcgaag atgtagtcgt tgcaggtgcg cacgaggtac 10140 

tggtatccga cgaggaagtg cggcggcggc tggcggtaga gcggccatcg ctcggtggcg 10200 

ggggcgccgg gcgcgaggtc ctcgagcatg aggcggtggt agccgtagat gtacctggac 10260 

atccaggtga tgccggcggc ggtggtggag gcgcgcggga actcgcggac gcggttccag 10320 

atgttgcgca gcggcaggaa gtagttcatg gtggccgcgg tctggcccgt gaggcgcgcg 10380 

cagtcgtgga tgctctagac atacgggcaa aaacgaaagc ggtcagcggc tcgactccgt 10440 

ggcctggagg ctaagcgaac gggttgggct gcgcgtgtac cccggttcga gtccctgctc 10500 

gaatcaggct ggagccgcag ctaacgtggt actggcactc ccgtctcgac ccaagcctgc 10560 

taacgaaacc tccaggatac ggaggcgggt cgttttggcc attttcgtca ggccggaaat 10620 

gaaactagta agcgcggaaa gcggccgtcc gcgacggctc gctgccgtag tctggagaaa 106B0 

gaatcgccag ggttgcgttg cggtgtgccc cggttcgagc ctcagcgctc ggcgccggcc 10740 

ggattccgcg gctaacgtgg gcgtggctgc cccgtcgttt ccaagacccc ttagccagcc 10800 

gacttctcca gttacggagc gagcccctct ttttcttgtg tttttgccag atgcatcccg 10860 

tactgcggca gatgcgcccc caccctccac cacaaccgcc cctaccgcag cagcagcaac 10920 

agccggcgct tctgcccccg ccccagcagc agcagccagc cactaccgcg gcggccgccg 10980 

tgagcggagc cggcgttcag tatgacctgg ccttggaaga gggcgagggg ctggcgcggc 11040 

tgggggcgtc gtcgccggag. cggcacccgc. gcgtgcagat gaaaagggac gctcgcgagg 11100 

cctacgtgcc caagcagaac ctgttcagag acaggagcgg cgaggagccc gaggagatgc 11160 

gcgcctcccg cttccacgcg gggcgggagc tgcggcgcgg cctggaccga aagcgggtgc 11220 

tgagggacga ggatttcgag gcggacgagc tgacggggat cagccccgcg cgcgcgcacg 11280 

tggccgcggc caacctggtc acggcgtacg agcagaccgc gaaggaggag agcaacttcc 11340 

aaaaatcctt caacaaccac gtgcgcacgc tgatcgcgcg cgaggaggtg accctgggcc 11400 
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tgatgcacct gtgggacctg ctggaggcca tcgtgcagaa ccccacgagc aagccgctga 11460 

cggcgcagct gtttctggtg gtgcagcaca gtcgggacaa cgagacgttc agggaggcgc 11520 

tgctgaatat caccgagccc gagggccgct ggctcctgga cctggtgaac attctgcaga 11580 

gcatcgtggt gcaggagcgc gggctgccge tgtccgagaa gctggcggcc atcaacttct 11640 

cggtgctgag cctgggcaag tactacgcta ggaagatcta caagaccccg tacgtgccca 11700 

tagacaagga ggtgaagatc gacgggtttt acatgcgcat gaccctgaaa gtgctgaccc 11760 

tgagcgacga tctgggggtg taccgcaacg acaggatgca ccgcgcggtg agcgccagcc 11820 

gccggcgcga gctgagcgac caggagctga tgcacagcct gcagcgggcc ctgaccgggg 11680 

ccgggaccga gggggagagc tactttgaca tgggcgcgga cctgcgctgg cagcccagcc 11940 

gccgggcctt ggaagctgcc ggcggttccc cctacgtgga ggaggtggac gatgaggagg 12000 

aggagggcga gtacctggaa gactgatggc gcgaccgtat ttttgctaga tgcagcaaca 12060 

gccaccgccg cctcctgatc ccgcgatgcg ggcggcgctg cagagccagc cgtccggcat 12120 

taactcctcg gacgattgga cccaggccat gcaacgcatc atggcgctga cgacccgcaa 12180 

tcccgaagcc tttagacagc agcctcaggc caaccgactc tcggccatcc tggaggccgt 12240 

ggtgccctcg cgctcgaacc ccacgcacga gaaggtgctg gccatcgtga acgcgctggt 12300 

ggagaacaag gccatccgcg gcgacgaggc cgggctggtg tacaacgcgc tgctggagcg 12360 

cgtggcccgc tacaacagca ccaacgtgca gacgaacctg gaccgcatgg tgaccgacgt 12420 

gcgcgaggcg gtgtcgcagc gcgagcggtt ccaccgcgag tcgaacctgg gctccatggt 12480 

ggcgctgaac gccttcctga gcacgcagcc cgccaacgtg ccccggggcc aggaggacta 12540 

caccaacttc atcagcgcgc tgcggctgat ggtggccgag gtgccccaga gcgaggtgta 12600 

ccagtcgggg ccggactact tcttccagac cagtcgccag ggcttgcaga ccgtgaacct 12660 

gagccaggct ttcaagaact tgcagggact gtggggcgtg caggccccgg tcggggaccg 12720 

cgcgacggtg tcgagectgc tgacgccgaa ctcgcgcctg ctgctgctgc tggtggcgcc 12780 

cttcacggac agcggcagcg tgagccgcga ctcgtacctg ggctacctgc ttaacctgta i2840 

ccgcgaggcc atcgggcagg cgcacgtgga cgagcagacc taccaggaga tcacccacgt 12900 

gagccgcgcg ctgggccagg aggacccggg caacctggag gccaccctga acttcctgct 12960 

gaccaaccgg tcgcagaaga tcccgcccca gtacgcgctg agcaccgagg aggagcgcat 13020 

cctgcgctac gtgcagcaga gcgtggggct gttcctgatg caggaggggg ccacgcccag 13080 
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cgccgcgctc gacatgaccg cgcgcaacat ggagcccagc atgtacgccc gcaaccgccc 13140 

gttcatcaat aagctgatgg actacttgca tcgggcggcc gccatgaact cggactactt 13200 

taccaacgcc atcttgaacc cgcactggct cccgccgccc gggttctaca cgggcgagta 13260 

cgacatgccc gaccccaacg acgggttcct gtgggacgac gtggacagca gcgtgttctc 13320 

gccgcgcccc accaccacca ccgtgtggaa gaaagagggc ggggaccggc ggccgtcctc 13380 

ggcgctgtcc ggtcgcgcgg gtgctgccgc ggcggtgccc gaggccgcca gccccttccc 13440 

gagcctgccc ttttcgctga acagcgtgcg cagcagcgag ctgggtcggc tgacgcggcc 13500 

gcgcctgctg ggcgaggagg agtacctgaa cgactccttg cttcggcccg agcgcgagaa 13560 

gaacttcccc aataacggga tagagagcct ggtggacaag atgagccgct ggaagacgta 13620 

cgcgcacgag cacagggacg agccccgagc tagcagcagc accggcgcca cccgtagacg 13680 

ccagcggcac gacaggcagc ggggtctggt gtgggacgat gaggattccg ccgacgacag 13740 

cagcgtgttg gacttgggtg ggagtggtgg tggtaacccg ttcgctcacc tgcgcccccg 13800 

tatcgggcgc ctgatgtaag aatctgaaaa aataaaagac ggtactcacc aaggccatgg 13860 

cgaccagcgt gcgttcttct ctgttgtttg tagtagtatg atgaggcgcg tgtacccgga 13920 

gggtcctcct ccctcgtacg agagcgtgat gcagcaggcg gtggcggcgg cgatgcagcc 13980 

cccgctggag gcgccttacg tgcccccgcg gtacctggcg cctacggagg ggcggaacag 14040 

cattcgttac tcggagctgg cacccttgta cgataccacc cggttgtacc tggtggacaa 14100 

caagtcggcg gacatcgcct cgctgaacta ccagaacgac cacagcaact tcctgaccac 14160 

cgtggtgcag aacaacgatt tcacccccac ggaggccagc acccagacca tcaactttga 14220 

cgagcgctcg cggtggggcg gccagctgaa aaccatcatg cacaccaaca tgcccaacgt 14280 

gaacgagttc atgtacagca acaagttcaa ggcgcgggtg atggtctcgc gcaagacccc 1434 0 

caacggggtc acagtaacag atggtagtca ggacgagctg acctacgagt gggtggagtt 14400 

tgagctgccc gagggcaact tctcggtgac catgaccatc gatctgatga acaacgccat 14460 

catcgacaac tacttggcgg tggggcggca gaacggggtg ctggagagcg acatcggcgt 14520 

gaagttcgac acgcgcaact tccggctggg ctgggacccc gtgaccgagc tggtgatgcc 14580 

gggcgtgtac accaacgagg ccttccaccc cgacatcgtc ctgctgcccg gctgcggcgt 14640 

ggacttcacc gagagccgcc tcagcaacct gctgggcatc cgcaagcggc agcccttcca 14700 

99*gggcttc cagatcctgt acgaggacct ggaggggggc aacatccccg cgctgctgga 14760 

cgtggacgcc tacgagaaaa gcaaggagga tagcgccgcc gcggcgaccg cagccgtggc 14820 
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caccgcctct accgaggtgc ggggcgataa ttttgctagc gccgcgacac tggcagcggc 14880 

cgaggcggct gaaaccgaaa gtaagatagt gatccagccg gtggagaagg acagcaagga 14940 

gaggagctac aacgtgctcg cggacaagaa aaacaccgcc taccgcagct ggtacctggc 15000 

ctacaactac ggcgaccccg agaagggcgt gcgctcctgg acgctgctca ccacctcgga 15060 

cgtcacctgc ggcgtggagc aagtctactg gtcgctgccc gacatgatgc aagacccggt 15120 

caccttccgc tccacgcgtc aagttagcaa ctacccggtg gtgggcgccg agctcctgcc 15180 

cgtctactcc aagagcttct tcaacgagca ggccgtctac tcgcagcagc tgcgcgcctt 15240 

cacctcgctc acgcacgtct tcaaccgctt ccccgagaac cagatcctcg ttcgcccgcc 15300 

cgcgcccacc attaccaccg tcagtgaaaa cgttcctgct ctcacagatc acgggaccct 15360 

gccgctgcgc agcagtatcc ggggagtcca gcgcgtgacc gtcactgacg ccagacgccg 15420 

cacctgcccc tacgtctaca aggccctggg cgtagtcgcg ccgcgcgtcc tctcgagccg 15480 

caccttctaa aaaatgtcca ttctcatctc gcccagtaat aacaccggtt ggggcctgcg 15540 

cgcgcccagc aagatgtacg gaggcgctcg ccaacgctcc acgcaacacc ccgtgcgcgt 15600 

gcgcgggcac ttccgcgctc cctggggcgc cctcaagggc cgcgtgcgct cgcgcaccac 15660 

cgtcgacgac gtgatcgacc aggtggtggc cgacgcgcgc aactacacgc ccgccgccgc 15720 

gcccgtctcc accgtggacg ccgtcatcga cagcgtggtg gccgacgcgc gccggtacgc 15780 

ccgcgccaag agccggcggc ggcgcatcgc ccggcggcac cggagcaccc ccgccatgcg 15840 

cgcggcgcga gccttgctgc gcagggccag gcgcacggga cgcagggcca tgctcagggc 15900 

ggccagacgc gcggcctccg gcagcagcag cgccggcagg acccgcagac gcgcggccac 15960 

ggcggcggcg gcggccatcg ccagcatgtc ccgcccgcgg cgcggcaacg tgtactgggt. 16020 

gcgcgacgcc gccaccggtg tgcgcgtgcc cgtgcgcacc cgcccccctc gcacttgaag 16080 
atgctgactt cgcgatgttg atgtgtccca gcggcgagga ggatgtccaa gcgcaaattc 16140 
aaggaagaga tgctccaggt catcgcgcct gagatctacg gcccggcggc ggtgaaggag 16200 
gaaagaaagc cccgcaaact gaagcgggtc aaaaaggaca aaaaggagga ggaagatgtg 16260 
gacggactgg tggagtttgt gcgcgagttc gccccccggc ggcgcgtgca gtggcgcggg 16320 
cggaaagtga aaccggtgct gcgacccggc accacggtgg tcttcacgcc cggcgagcgt 16380 
tccggctccg cctccaagcg ctcctacgac gaggtgtacg gggacgagga catcctcgag 16440 
caggcggccg aacgtctggg cgagtttgct tacggcaagc gcagccgccc cgcgcccttg 16500 
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aaagaggagg cggtgtccat cccgctggac cacggcaacc ccacgccgag cctgaagccg 16560 
gtgaccctgc agcaggtgct gcctggtgcg gcgccgcgcc ggggcttcaa gcgcgagggc 16620 
ggcgaggatc tgtacccgac catgcagctg atggtgccca agcgccagaa gctggaggac 16680 
gtgctggagc acatgaaggt ggaccccgag gtgcagcccg aggtcaaggt gcggcccatc 16740 . 
aagcaggtgg ccccgggcct gggcgtgcag accgtggaca tcaagatccc cacggagccc 16800 
atggaaacgc agaccgagcc cgtgaagccc agcaccagca ccatggaggt gcagacggat 16860 
ccctggatgc cggcaccggc ttccaccacc cgccgaagac gcaagtacgg cgcggccagc 16920 
ctgctgatgc ccaactacgc gctgcatcct tccatcatcc ccacgccggg ctaccgcggc 16980 
acgcgcttct accgcggcta caccagcagc cgccgccgca agaccaccac ccgccgccgc 17040 
cgtcgtcgca cccgccgcag cagcaccgcg acttccgccg ccgccctggt gcggagagtg 17100 
taccgcagcg ggcgcgagcc tctgaccctg ccgcgcgcgc gctaccaccc gagcatcgcc 17160 
atttaactac cgcctcctac ttgcagatat ggccctcaca tgccgcctcc gcgtccccat 17220 
tacgggctac cgaggaagaa agccgcgccg tagaaggctg acggggaacg ggctgcgtcg 17280 
ccatcaccac cggcggcggc gcgccatcag caagcggttg gggggaggct tcctgcccgc 17340 
gctgatgccc atcatcgccg cggcgatcgg ggcgatcccc ggcatagctt ccgtggcggt 17400 
gcaggcctct cagcgccact gagacacagc ttggaaaatt tgtaataaaa aatggactga 17460 
cgctcctggt cctgtgatgt gtgtttttag atggaagaca tcaatttttc gtccctggca 17520 
ccgcgacacg gcacgcggcc gtttatgggc acctggagcg acatcggcaa cagccaactg 17580 
aacgggggcg ccttcaattg gagcagtctc tggagcgggc ttaagaattt cgggtccacg 17640 
ctcaaaacct atggcaacaa ggcgtggaac agcagcacag ggcaggcgct gagggaaaag 17700 
ctgaaagagc agaacttcca gcagaaggtg gtcgatggcc tggcctcggg catcaacggg 17760 
gtggtggacc tggccaacca ggccgtgcag aaacagatca acagccgcct ggacgcggtc 17620 
ccgcccgcgg ggtccgtgga gatgccccag gtggaggagg agctgcctcc cctggacaag 17880 
cgcggcgaca agcgaccgcg tcccgacgcg gaggagacgc tgctgacgca cacggacgag 17940 
ccgcccccgt acgaggaggc ggtgaaactg ggtctgccca ccacgcggcc cgtggcgcct 18000 
ctggccaccg gggtgctgaa acccagcagc agcagcagcc agcccgcgac cctggacttg 18060 
cctccgcctg cttcccgccc ctccacagtg gctaagcccc tgccgccggt ggccgtcgcg 18120 
tcgcgcgccc cccgaggccg cccccaggcg aactggcaga gcactctgaa cagcatcgtg 18180 
ggtctgggag tgcagagtgt gaagcgccgc cgctgctatt aaaagacact gtagcgctta 18240 
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acttgcttgt ctgtgtgtat atgtatgtcc gccgaccaga aggaggagga agaggcgcgt 18300 

cgccgagttg caagatggcc accccatcga tgctgcccca gtgggcgtac atgcacatcg 18360 

ccggacagga cgcttcggag tacctgagtc cgggtctggt gcagttcgcc cgcgccacag 18420 

acacctactt cagtctgggg aacaagttta ggaaccccac ggtggcgccc acgcacgatg 18480 

tgaccaccga ccgcagccag cggctgacgc tgcgcttcgt gcccgtggac cgcgaggaca 18540 

acacctactc gtacaaagtg cgctacacgc tggccgtggg cgacaaccgc gtgctggaca 18600 

tggccagcac ctactttgac atccgcggcg tgctggatcg gggccctagc ttcaaaccct 18660 

actccggcac cgcttacaac agcctggctc ccaagggagc gcccaacact tgccagtgga 18720 

catataaagc tgatggtgat actggtacag aaaaaaccta tacatatgga aatgcgcctg 18780 

tgcaaggcat tagtattaca aaagatggta ttcaacttgg aactgacact gatgatcagc 18840 

ccattcatgc agataaaact tatcaaccag agcctcaagt gggtgatgct gaatggcatg 18900 

acatcactgg tactgatgaa aaatatggag gcagagctct caagcctgac accaaaatga 18960 

agccctgcta tggttctttt gccaagccta ccaataaaga aggaggtcag gcaaatgtga 19020 

aaaccgaaac aggcggtacc aaagaatatg acattgacat ggcattcttc gataatcgaa 19080 

gtgcagctgc ggctggcctg gccccagaaa ttgttttgta tactgagaat gtggatctgg 19140 

aaactccaga tactcatatt gtatacaagg cgggcacaga tgacagcagc tcttctatca 19200 

atttgggtca gcagtccatg cccaacagac ccaactacat tggctttaga gacaacttta 19260 

tcgggctcat gtactacaac agcactggca acatgggcgt gctggctggt caggcctccc 19320 

agctgaatgc tgtggtggac ttgcaggaca gaaacactga actgtcctac cagctcttgc 19380 

ttgactctct gggcgacaga accaggtatt tcagtatgtg gaatcaggcg gtggacagct 19440 

atgaccccga tgtgcgcatt attgaaaatc acggtgtgga ggatgaactc cctaactatt 19500 

gcttccccct ggatgctgtg ggtagaactg atacttacca gggaattaag gccaatggtg 19560 

ctgatcaaac cacctggacc aaagatgata ctgttaatga tgctaatgaa ttgggcaagg 19620 

gcaatccttt cgccatggag atcaacatcc aggccaacct gtggcggaac ttcctctacg 19680 

cgaacgtggc gctgtacctg cccgactcct acaagtacac gccggccaac atcacgctgc 1974 0 

cgaccaacac caacacctac gattacatga acggccgcgt ggtggcgccc tcgctggtgg 19800 

acgcctacat caacatcggg gcgcgctggt cgctggaccc catggacaac gtcaacccct 19860 

tcaaccacca ccgcaacgcg ggcctgcgct accgctccat gctcctgggc aacgggcgct 19920 
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acgtgccctt ccacatccag gtgccccaaa agttcttcgc catcaagagc ctcctgctcc 19980 

tgcccgggtc ctacacctac gagtggaact tccgcaagga cgtcaacatg atcctgcaga 2004 0 

gctccctcgg caacgacctg cgcacggacg gggcctccat cgccttcacc agcatcaacc 20100 

tctacgccac cttcttcccc atggcgcaca acaccgcctc cacgctcgag gccatgctgc 20160 

gcaacgacac caacgaccag tccttcaacg actacctctc ggcggccaac atgctctacc 20220 

ccatcccggc caacgccacc aacgtgccca tctccatccc ctcgcgcaac tgggccgcct 20280 

tccgcggatg gtccttcacg cgcctcaaga cccgcgagac gccctcgctc ggctccgggt 20340 

tcgaccccta cttcgtctac tcgggctcca tcccctacct cgacggcacc ttctacctca 20400 

accacacctt caagaaggtc tccatcacct tcgactcctc cgtcagctgg cccggcaacg 204 60 

accgcctcct gacgcccaac gagttcgaaa tcaagcgcac cgtcgacgga gaggggtaca 20520 

acgtggccca gtgcaacatg accaaggact ggttcctggt ccagatgctg gcccactaca 20580 

acatcggcta ccagggcttc tacgtgcccg agggctacaa ggaccgcatg tactccttct 20640 

tccgcaactt ccagcccatg agccgccagg tcgtggacga ggtcaactac aaggactacc 20700 

aggccgtcac cctggcctac cagcacaaca actcgggctt cgtcggctac ctcgcgccca 20760 

ccatgcgcca gggacagccc taccccgcca actaccccta cccgctcatc ggcaagagcg 20820 

ccgtcgccag cgtcacccag aaaaagttcc tctgcgaccg ggtcatgtgg cgcatcccct. 20880 

tctccagcaa cttcatgtcc atgggcgcgc tcaccgacct cggccagaac atgctctacg 20940 

ccaactccgc ccacgcgcta gacatgaatt tcgaagtcga ccccatggat gagtccaccc 21000 

ttctctatgt tgtcttcgaa gtcttcgacg tcgtccgagt gcaccagccc caccgcggcg 21060 

tcatcgaggc cgtctacctg cgcacgccct tctcggccgg caacgccacc acctaagccc 21120 

cgctcttgct tcttgcaaga tgacggcctg tgcgggctcc ggcgagcagg agctcagggc 21180 

catcctccgc gacctgggct gcgggccctg cttcctgggc accttcgaca agcgcttccc 21240 

gggattcatg gccccgcaca agctggcctg cgccatcgtc aacacggccg gccgcgagac 21300 

cgggggcgag cactggctgg ccttcgcctg gaacccgcgc tcccacacct gctacctctt 21360 

cgaccccttc gggttctcgg acgagcgcct caagcagatc taccagttcg agtacgaggg 21420 

cctgctgcgc cgcagcgccc tggccaccga ggaccgctgc gtcaccctgg aaaagtccac 21480 

ccagaccgtg cagggtccgc gctcggccgc ctgcgggctc ttctgctgca tgttcctgca 21540 

cgccttcgtg cactggcccg accgccccat ggacaagaac cccaccatga acttgctgac 21600 
gggggtgccc aacggcatgc tccagtcgcc ccaggtggaa cccaccctgc gccgcaacca 21660 
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ggaggcgctc taccgcttcc tcaacgccca ctccgcctac tttcgctccc accgcgcgcg 21720 

catcgagaag gccaccgcct tcgaccgcat gaatcaagac atgtaaaccg tgtgtgtatg 21780 

tgaatgcttt attcataata aacagcacat gtttatgcca ccttttctga ggctctgact 21840 

ttatttagaa atcgaagggg ttctgccggc tctcggcgtg ccccgcgggc agggatacgt 21900 

tgcggaactg gtacttgggc agccacttga actcggggat cagcagcttc ggcacgggga 21960 

ggtcggggaa cgagtcgctc cacagcttgc gcgtgagttg cagggcgccc agcaggtcgg 22020 

gcgcggagat cttgaaatcg cagttgggac ccgcgttctg cgcgcgggag ttgcggtaca 22080 

cggggttgca gcactggaac accatcaggg ccgggtgctt cacgctcgcc agcaccgtcg 22140 

cgtcggtgat gccctccacg tccagatcct cggcgttggc catcccgaag ggggtcatct 22200 

tgcaggtctg ccgccccatg ctgggcacgc agccgggctt gtggttgcaa tcgcagtgca 22260 

gggggatcag catcatctgg gcctgctcgg agctcatgcc cgggtacatg gccttcatga 22320 

aagcctccag ctggcggaag gcctgctgcg ccttgccgcc ctcggtgaag aagaccccgc 22380 

aggacttgct agagaactgg ttggtggcgc agccggcgtc gtgcacgcag cagcgcgcgt 22440 

cgttgttggc cagctgcacc acgctgcgcc cccagcggtt ctgggtgatc ttggcccggt 22500 

cggggttctc cttcagcgcg cgctgcccgt tctcgctcgc cacatccatc tcgatcgtgt 22560 

gctccttctg gatcatcacg gtcccgtgca ggcatcgcag cttgccctcg gcctcggtgc 22620 

acccgtgcag ccacagcgcg cagccggtgc actcccagtt cttgtgggcg atctgggagt 22680 

gcgagtgcac gaagccctgc aggaagcggc ccatcatcgt ggtcagggtc ttgttgctgg 22740 

tgaaggtcag cgggatgccg cggtgctcct cgttcacata caggtggcag atgcggcggt 22800 

acacctcgcc ctgctcgggc atcagctgga aggcggactt caggtcgctc tccacgcggt 22860 

accggtccat cagcagcgtc atgacttcca tgcccttctc ccaggccgag acgatcggca 22920 

ggctcagggg gttcttcacc gccgttgtca tcttagtcgc cgccgctgag gtcagggggt 22980 

cgttctcgtc cagggtctca aacactcgct tgccgtcctt ctcggtgatg cgcacggggg 23040 

gaaagctgaa gcccacggcc gccagctcct cctcggcctg cctttcgtcc tcgctgtcct 23100 

ggctgacgtc ttgcaaaggc acatgcttgg tcttgcgggg tttctttttg ggcggcagag 23160 

gcggcggcgg agacgtgctg ggcgagcgcg agttctcgct caccacgact atttcttctt 23220 

cttggccgtc gtccgagacc acgcggcggt aggcatgcct cttctggggc agaggcggag 23280 

gcgacgggct ctcgcggttc ggcgggcggc tggcagagcc ccttccgcgt tcgggggtgc 23340 
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gctcctggcg gcgctgctct gactgacttc ctccgcggcc ggccattgtg ttctcctagg 23400 
gagcaacaag catggagact cagccatcgt cgccaacatc gccatctgcc cccgccgccg 23460 
ccgacgagaa ccagcagcag aatgaaagct taaccgcccc gccgcccagc cccacctccg 23520 
acgccgccgc ggccccagac atgcaagaga tggaggaatc catcgagatt gacctgggct 23580 
acgtgacgcc cgcggagcac gaggaggagc tggcagcgcg cttttcagcc ccggaagaga 23640 
accaccaaga gcagccagag caggaagcag agagcgagca gcagcaggct gggctcgagc 23700 
atggcgacta cctgagcggg gcagaggacg tgctcatcaa gcatctggcc cgccaatgca 23760 
tcatcgtcaa ggacgcgctg ctcgaccgcg ccgaggtgcc cctcagcgtg gcggagctca 23820 
gccgcgccta cgagcgcaac ctcttctcgc cgcgcgtgcc ccccaagcgc cagcccaacg 23880 
gcacctgcga gcccaacccg cgcctcaact tctacccggt cttcgcggtg cccgaggccc 23940 
tggccaccta ccacctcttt ttcaagaacc aaaggatccc cgtctcctgc cgcgccaacc 24000 
gcacccgcgc cgacgccctg ctcaacctgg gtcccggcgc ccgcctacct gatatcgcct 24060 
ccttggaaga ggttcccaag atcttcgagg gtctgggcag cgacgagact cgggccgcga 24120 
acgctctgca aggaagcgga gaggagcatg agcaccacag cgccctggtg gagttggaag 24180 
gcgacaacgc gcgcctggcg gtgctcaagc gcacggtcga gctgacccac ttcgcctacc . 24240 
cggcgctcaa cctgcccccc aaggtcatga gcgccgtcat ggaccaggtg ctcatcaagc 24 300 
gcgcctcgcc cctctcggat gaggacatgc aggaccccga gagctcggac gagggcaagc 24360 
ccgtggtcag cgacgagcag ctggcgcgct ggctgggagc gagtagcacc ccccagagct 24420 
tggaagagcg gcgcaagctc atgatggccg tggtcctggt gaccgtggag ctggagtgtc 24480 
tgcgccgctt cttcgccgac gcagagaccc tgcgcaaggt cgaggagaac ctgcactacc 24540 
tcttcaggca cgggtttgtg cgccaggcct gcaagatctc caacgtggag ctgaccaacc 24600 
tggtctccta catgggcatc ctgcacgaga accgcctggg gcagaacgtg ctgcacacca 24 660 
ccctgcgcgg ggaggcccgc cgcgactaca tccgcgactg cgtctacctg tacctctgcc 24720 
acacctggca gacgggcatg ggcgtgtggc agcagtgcct ggaggagcag aacctgaaag 24780 
agctctgcaa gctcctgcag aagaacctga aggccctgtg gaccgggttc gacgagcgca 24840 
ccaccgcctc ggacctggcc gacctcatct tccccgagcg cctgcggctg acgctgcgca 24 900 
acggactgcc cgactttatg agtcaaagca tgttgcaaaa ctttcgctct ttcatcctcg 24960 
aacgctccgg gatcctgccc gccacctgct ccgcgctgcc ctcggacttc gtgccgctga 25020 
ccttccgcga gtgccccccg ccgctctgga gccactgcta cctgctgcgc ctggccaact 25080 
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acctggccta ccactcggac gtgatcgagg acgtcagcgg cgagggtctg ctcgagtgcc 25140 

actgccgctg caacctctgc acgccgcacc gctccctggc ctgoaacccc cagctgctga 25200 

gcgagaccca gatcatcggc accttcgagt tgcaaggccc cggcgagggc aaggggggtc 25260 

tgaaactcac cccggggctg tggacctcgg cctacttgcg caagttcgtg cccgaggact 25320 

accatccctt cgagatcagg ttctacgagg accaatccca gccgcccaag gccgaactgt 25380 

cggcctgcgt catcacccag ggggccatcc tggcccaatt gcaagccatc cagaaatccc 25440 

gccaagaatt tctgctgaaa aagggccacg gggtctacct ggacccccag accggagagg 25500 

agctcaaccc cagcttcccc caggatgccc cgaggaagca gcaagaagct gaaagtggag 25560 

ccgccgccgc cggaggattt ggaggaagac tgggagagca gtcaggcaga ggaggaggag 25620 

acggaagact gggacagcac tcaggcagag gaggacagcc tgcaagacag tctggaagac 25680 

gaggtggagg aggaggcaga ggaagaagca gccgccgcca gaccgtcgtc ctcggcggag 25740 

aaagcaagca gcacggatac catctccgct ccgggtcggg gtcgcggcga ccgggcccac 25800 

agtaggtggg acgagaccgg gcgcttcccg aaccccacca cccagaccgg taagaaggag 25860 

cggcagggat acaagtcctg gcgggggcac aaaaacgcca tcgtctcctg cttgcaagcc 25920 

tgcgggggca acatctcctt cacccgccgc Cacctgctct tccaccgcgg ggtgaacttc .25980 

ccccgcaaca tcttgcatta ctaccgtcac ctccacagcc cctactactg tttccaagaa 26040 

gaggcagaaa cccagcagca gcagaaaacc agcggcagca gcagctagaa aatccacagc 26100 

ggcggcaggt ggactgagga tcgcagcgaa cgagccggcg cagacccggg agctgaggaa 26160 

ccggatcttt cccaccctct atgccatctt ccagcagagt cgggggcagg agcaggaact 26220 

gaaagtcaag aaccgttctc tgcgctcgct cacccgcagt tgtctgtatc acaagagcga 26280 

agaccaactt cagcgcactc tcgaggacgc cgaggctctc ttcaacaagt actgcgcgct 26340 

cactcttaaa gagtagcccg cgcccgccca cacacggaaa aaggcgggaa ttacgtcacc 26400 

acctgcgccc ttcgcccgac catcatcatg agcaaagaga ttcccacgcc ttacatgtgg 26460 

agctaccagc cccagatggg cctggccgcc ggcgccgccc aggactactc cacccgcatg 26S20 

aactggctca gcgccgggcc cgcgatgatc tcacgggtga atgacatccg cgcccgccga 26580 

aaccagatac tcctagaaca gtcagcgatc accgccacgc cccgccatca ccttaatccg 26640 

cgtaattggc ccgccgccct ggtgtaccag gaaattcccc agcccacgac cgtactactt 26700 

ccgcgagacg cccaggccga agtccagctg actaactcag gtgtccagct ggccggcggc 26760 
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gccgccctgt gtcgtcaccg ccccgctcag ggtataaagc ggctggtgat ccgaggcaga 26820 

ggcacacagc tcaacgacga ggtggtgagc tctccgctgg gtctgcgacc tgacggagtc 26880 

ttccaactcg ccggatcggg gagatcttcc ttcacgcctc gtcaggccgt cctgactttg 26940 

gagagttcgt cctcgeagcc ccgctcgggt ggcatcggca ctctccagtt cgtggaggag .27000 

ttcactccct cggtctactt caaccccttc tccggctccc ccggccacta cccggacgag 27060 

ttcatcccga acttcgacgc catcagcgag tcggtggacg gctacgattg aatgtcccat 27120 

ggcggcgcag ctgacctago tcggcttcga cacctggacc actgccgccg cttccgctgc 27180 

ttcgctcggg atctcgccga gtttgcctac tttgagctgc ccgaggagca ccctcagggc 27240 

ccggcccacg gagtgcggat catcgtcgaa gggggcctcg actcccacct gcttcggatc 27300 

ttcagccagc gaccgatcct ggtcgagcgc gagcaaggac agacccttct gaccctgtac 27360 

tgcatctgca accaccccgg cctgcatgaa agtctttgtt gtctgctgtg tactgagtat 27420 

aataaaagct gagatcagcg actactccgg actcgattgt ggcgttcctg ctatcaaccg 27480 

gtccctgttc ttcaccggga acgagaccga gctccagctt cagtgtaagc cccacaagaa 2754 0 

gtacctcacc tggctgttco agggctcccc gatcgccgtt gtcaaccact gcgacaacga 27600 

cggagtcctg ctgagcggcc ccgccaacct tactttttcc acccgcagaa gcaagctcca 276.60 

gctcttccaa cccttcctcc ccgggaccta tcagtgcgtc tcgggaccct gccatcacat 27720 

cttccacctg atcecgaata ccacagcgcc gctccccgct actaacaacc aaactaccca 27780 

ccatcgccac cgtcgcgacc tttctgaatc taacactacc acccacaccg gaggtgagct 27840 

ccgaggtcga ccaacctctg ggatttacta cggcccctgg gaggtggtgg ggttaatagc 27900 

gctaggccta gttgtgggtg ggcttttggc tctctgctac ctatacctcc cttgctgttc 27960 

gtacttagtg gtgctgtgtt gctggtttaa gaaatgggga agatcaccct agtgagctgc 28020 

ggtgcgctgg tggcggtggt ggtgttttcg attgtgggac tgggcggcgc ggctgtagtg 28080 

aaggagaagg ccgatccctg cttgcatttc aatcccgaca attgccagot gagttttcag 28140 

cccgatggca atcggtgcgc ggtgctgatc aagcgcggat gggaatgcga gaacgtgaga 28200 

atcgagtaca ataacaagac tcggaacaat actctcgcgt. ccgcgtggca gcccggggac 28260 

cccgagtggt acaccgtctc tgtccccggt gctgacggct ccccgcgcac cgtgaacaat 28320 

actttcattt ttgcgcacat gtgcgacacg gtcatgtgga tgagcaagca gtacgatatg 28380 

tggcccccca cgaaggagaa catcgtggtc ttctccatcg cttacagcgc gtgcacggcg 28440 

ctaatcaccg ctatcgtgtg cctgagcatt cacatgctca tcgctattcg ccccagaaat 28500 
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aatgccgaaa aagagaaaca gccataacac gttttttcac acaccttttt cagaccatgg 26560 

cctctgttaa atttttgctt ttatttgcca gtctcattac tgttataagt aatgagaaac 28620 

tcactattta cattggcact aaccacactt tagacggaat tccaaaatcc tcatggtatt 28680 

gctattttga tcaagatcca gacttaacta tagaactgtg tggtaacaag ggaaaaaata 28740 

caagcattca tttaattaac tttaattgcg gagacaattt gaaattaatt aatatcacta 28800 

aagagtatgg aggtatgtat tactatgttg cagaaaataa caacatgcag ttttatgaag 28860 

ttactgtaac taatcccacc acacctagaa caacaacaac caccaccaca aaaactacac 28920 

ctgttaccac tatgcagctc actaccaata acatttttgc catgcgtcaa atggtcaaca 28980 

atagcactca acccacccca cccagtgagg aaattcccaa atccatgatt ggcattattg 29040 

ttgctgtagt ggtgtgcatg ttgatcatcg ccttgtgcat ggtgtactat gccttctgct 29100 

acagaaagca cagactgaac gacaagctgg aacacttact aagtgttgaa ttttaatttt 29160 

ttagaaccat gaagatccta ggccttttaa ttttttctat cattacctct gctctatgca 29220 

attctgacaa tgaggacgtt actgtcgttg tcggaaccaa ttatacactg aaaggtccag 29280 

cgaagggtat gctttcgtgg tattgctggt ttggaactga cgagcaacag acagagctct 29340 

gcaatgctca aaaaggcaaa acctcaaatt ctaaaatctc taattatcaa tgcaatggca .29400 

ctgacttagt actgctcaat gtcacgaaag catatgctgg cagctacacc tgccctggag 29460 

atgatactga gaacatgatt ttttacaaag tggaagtggt tgatcccact actccacctc 29520 

cacccaccac aactactcac accacacaca cagaacaaac cacagcagag gaggcagcaa 2 9580 

agttagcctt gcaggtccaa gacagttcat ttgttggcat tacccctaca cctgatcagc 29640 

ggtgtccggg gctgctcgtc agcggcattg tcggtgtgct ttcgggatta gcagtcataa 29700 

tcatctgcat gttcattttt gcttgctgct atagaaggct ttaccgacaa aaatcagacc 29760 

cactgctgaa cctctatgtt taattttttc cagagccatg aaggcagtta gcactctagt 29820 

tttttgttct ttgattggca ctgtttttag tgttagcttt ttgaaacaaa tcaatgttac 29880 

tgagggggaa aatgtgacac tggtaggcgt agagggtgct caaaatacca cctggacaaa 2994 0 

attccatcta gatgggtgga aagaaatttg cacctggaat gtcagtactt atacatgtga 30000 

aggagttaat cttaccattg tcaatgtcag ccaaattcaa aagggttgga ttaaagggca 30060 

atctgttagt gttagcaata gtgggtacta tacccagcat actcttatct atgacattat 30120 

agttatacca ctgcctacac ctagcccacc tagcactacc acacagacaa cccacactac 30180 
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acaaacaacc acatacagta catcaaatca gcctaccacc actacaacag cagaggttgc 30240 
cagctcgtct ggggtccgag tggcattttt gatgttggcc ccatctagca gtcccactgc 30300 
tagtaccaat gagcagacta ctgaattttt gtccactgtc gagagccaca ccacagctac 30360 
ctcgagtgcc ttctctagca ccgccaatct atcctcgctt tcctctacac caatcagtcc 3 0420 
cgctactact cctacccccg ctattctccc cactcccctg aagcaaacag acggcgacat 30460 
gcaatggcag atcaccctgc tcattgtgat cgggttggtc atcctggccg tgttgctcta 30540 
ctacatcttc tgccgccgca ttcccaacgc gcaccgcaag ccggcctaca agcccatcgt 30600 
tgtcgggcag ccggagccgc ttcaggtgga agggggtcta aggaatcttc tcttctcttt 30660 
tacagtatgg tgattgaatt atgattccta gacaaatctt gatcactatt cttatctgcc 30720 
tcctccaagt ctgtgccacc ctcgctctgg tggccaacgc cagtccagac tgtattgggc 30780 
ccttcgcctc ctacgtgctc tttgccttca tcacctgcat ctgctgctgt agcatagtct 30840 
gcctgcttat caccttcttc cagttcattg actggatctt tgtgcgcatc gcctacctgc 30900 
gccaccaccc ccagtaccgc gaccagcgag tggcgcggct gctcaggatc ctctgataag 30960 
catgcgggct ctgctacttc tcgcgcttct gctgttagtg ctcccccgtc ccgtcgaccc 31020 
ccggaccccc acccagtccc ccgaggaggt ccgcaaatgc aaattccaag aaccctggaa 31080 
attcctcaaa tgctaccgcc aaaaatcaga catgcatccc agctggatca tgatcattgg 31140 
gatcgtgaac attctggcct gcaccctcat ctcctttgtg atttacccct gctttgactt 31200 
tggttggaac tcgccagagg cgctctatct cccgcctgaa cctgacacac caccacagca 31260 
acctcaggca cacgcactac caccaccacc acagcctagg ccacaataca tgcccatatt 31320 
agactatgag gccgagccac agcgacccat gctccccgct attagttact tcaatctaac 31380 
cggcggagat gactgaccca ctggccaaca acaacgtcaa cgaccttctc ctggacatgg 31440 
acggccgcgc ctcggagcag cgactcgccc aacttcgcat tcgccagcag caggagagag 31500 
ccgtcaagga gctgcaggac ggcatagcca tccaccagtg caagaaaggc atcttctgcc 31560 
tggtgaaaca ggccaagatc tcctacgagg tcacccagac cgaccatcgc ctctcctacg 31620 
agctcctgca gcagcgccag aagttcacct gcctggtcgg agtcaacccc atcgtcatca 31680 
cccagcagtc gggcgatacc aaggggtgca tccactgctc ctgcgactcc cccgactgcg 31740 
tccacactct gatcaagacc ctctgcggcc tccgcgacct cctccccatg aactaatcac 31800 
ccccttatcc agtgaaataa agatcatatt gatgatttga gtttaataaa aataaagaat 31860 
cacttacttg aaatctgata ccaggtctct gtccatgttt tctgccaaca ccacttcact 31920 
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gagcgagcca gtctcgggtc ggtcagggag atgaaaccct ccgggcactc ccgcatctgc 33660 

acctcacagc tcaacagctg aggattgtcc tcggtggtcg ggatcacggt tatctggaag 33720 

aagcagaaga gcggcggtgg gaatcatagt ccgcgaacgg gatcggccgg tggtgtcgca 33780 

tcaggccccg cagcagtcgc tgtcgccgcc gctccgtcaa.gctgctgctc agggggtccg 33840 

ggtccaggga ctccctcagc atgatgccca cggccctcag catcagtcgt ctggtgcggc 33900 

gggcgcagca gcgcatgcgg atctcgctca ggtcgctgca gtacgtgcaa cacaggacca 33 960 

ccaggttgtt caacagtcca tagttcaaca cgctccagcc gaaactcatc gcgggaagga 34020 

tgctacccac gtggccgtcg taccagatcc tcaggtaaat caagtggcgc cccctccaga 34080 

acacgctgcc catgtacatg atctccttgg gcatgtggcg gttcaccacc tcccggtacc 34140 

acatcaccct ctggttgaac atgcagcccc ggatgatcct gcggaaccac agggccagca 34200 

ccgccccgcc cgccatgcag cgaagagacc ccgggtcccg acaatggcaa tggaggaccc 34260 

accgctcgta cccgtggatc atctgggagc tgaacaagtc tatgttggca cagcacaggc 34320 

atatgctcat gcatctcttc agcactctca gctcctcggg ggtcaaaacc atatcccagg 34380 

gcacggggaa ctcttgcagg acagcgaacc ccgcagaaca gggcaatcct cgcacataac 34440 

ttacattgtg catggacagg gtatcgcaat caggcagcac cgggtgatcc tccaccagag 34500 

aagcgcgggt ctcggtctcc tcacagcgtg gtaagggggc cggccgatac gggtgatggc 34560 

gggacgcggc tgatcgtgtt cgcgaccgtg ttatgatgca gttgctttcg gacattttcg 34620 

tacttgctgt agcagaacct ggtccgggcg ctgcacaccg atcgccggcg gcggtcccgg. 34680 

cgcttggaac gctcggtgtt gaagttgtaa aacagccact ctctcagacc gtgcagcaga 3474 0 

tctagggcct caggagtgat gaagatccca tcatgcctga tggctctaat cacatcgacc 34800 

accgtggaat gggccagacc cagccagatg atgcaatttt gttgggtttc ggtgacggcg 34860 

ggggagggaa gaacaggaag aaccatgatt aacttttaat ccaaacggtc tcggagcact 34920 

tcaaaatgaa gatcgcggag atggcacctc tcgcccccgc tgtgttggtg gaaaataaca 34980 

gccaggtcaa aggtgatacg gttctcgaga tgttccacgg tggcttccag caaagcctcc 35040 

acgcgcacat ccagaaacaa gacaatagcg aaagcgggag ggttctctaa ttcctcaatc 35100 

atcatgttac actcctgcac catccccaga taattttcat ttttccagcc ttgaatgatt 35160 

cgaactagtt cctgaggtaa atccaagcca gccatgataa agagctcgcg cagagcgccc 35220 

tccaccggca ttcttaagca caccctcata attccaagat attctgctcc tggttcacct 3528 0 

gcagcagatt gacaagcgga atatcaaaat ctctgccgcg atccctaagc tcctccctca 3534 0 
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gcaataactg taagtactct ttcatatcct ctccgaaatt tttagccata ggaccaccag 35400 

gaataagatt agggcaagcc acagtacaga taaaccgaag tcctccccag tgagcattgc 35460 

caaatgcaag actgctataa gcatgctggc tagacccggt gatatcttcc agataactgg 35520 

acagaaaatc. gcccaggcaa tttttaagaa aatcaacaaa agaaaaatcc tccaggtgca 35580 

cgtttagagc ctcgggaaca acgatggagt aaatgcaagc ggtgcgttcc agcatggtta 3S64 0 

gttagctgat ctgtagaaaa aaacaaaaat gaacattaaa ccatgctagc ctggcgaaca 35700 

ggtgggtaaa tcgttctctc cagcaccagg caggccacgg ggtctccggc acgaccctcg 3 5760 

taaaaattgt cgctatgatt gaaaaccatc acagagagac gttcccggtg gccggcgtga 35820 

atgattcgac aagatgaata cacccccgga acattggcgt ccgcgagtga aaaaaagcgc 35880 

ccaaggaagc aataaggcac tacaatgctc agtctcaagt ccagcaaagc gatgccatgc 35940 

ggatgaagca caaaattctc aggtgcgtac aaaatgtaat tactcccctc ctgcacaggc 36000 

agcaaagccc ccgatccctc caggtacaca tacaaagcct cagcgtccat agcttaccga 36060 

gcagcagcac acaacaggcg caagagtcag agaaaggctg agctctaacc tgtccacccg 36120 

ctctctgctc aatatatagc ccagatctac actgacgtaa aggccaaagt ctaaaaatac 36180 

ccgccaaata atcacacacg cccagcacac gcccagaaac cggtgacaca ctcaaaaaaa 36240 

tacgcgcact tcctcaaacg cccaaactgc cgtcatttcc gggttcccac gctacgtcat 36300 

caaaattcga ctttcaaatt ccgtcgaccg ttaaaaacgt cgcccgcccc gcccctaacg 36360 

gtcgccgctc ccgcagccaa tcaccgcccc gcatccccaa attcaaatac ctcatttgca 36420 

tattaacgcg caccaaaagt ttgaggtata ttattgatga tg 36462 

<210> 2 
<211> 36604 
<212> DNA 

<213> chimpanzee adenovirus serotype Pan 6 
<400> 2 ; 

catcatcaat aatatacctc aaacttttgg tgcgcgttaa tatgcaaatg agctgtttga 60 
atttggggag ggaggaaggt gattggctgc gggagcggcg accgttaggg gcggggcggg 120 
tgacgttttg atgacgtggc tatgaggcgg agccggtttg caagttctcg tgggaaaagt 180 
gacgtcaaac gaggtgtggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 240 
ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300 
aatgaggaag tgaaaatctg agtaatttcg cgtttatggc agggaggagt atttgccgag 360 
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ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacctaaat 420 

ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggc gtcagctgat cgccagggta 480 

tttaaacctg cgctctctag tcaagaggcc actcttgagt gccagcgagt agagttttct 540 

cctccgcgcc gcgagtcaga. tctacacttt gaaagatgag gcacctgaga gacctgcccg 600 

gtaatgtttt cctggctact gggaacgaga ttctggaatt ggtggtggac gccatgatgg 660 

gtgacgaccc tccagagccc cctaccccat ttgaggcgcc ttcgctgtac gatttgtatg 720 

• atctggaggt ggatgtgccc gagagcgacc ctaacgagga ggcggtgaat gatttgttta 780 

gcgatgccgc gctgctggct gccgagcagg ctaatacgga ctctggctca gacagcgatt 840 

cctctctcca taccccgaga cccggcagag gtgagaaaaa gatccccgag cttaaagggg 900 

aagagctcga cctgcgctgc tatgaggaat gcttgcctcc gagcgatgat gaggaggacg 960 

aggaggcgat tcgagctgcg gtgaaccagg gagtgaaaac tgcgggcgag agctttagcc 1020 

tggactgtcc tactctgccc ggacacggct gtaagtcttg tgaatttcat cgcatgaata 1080 

ctggagataa gaatgtgatg tgtgccctgt gctatatgag agcttacaac cattgtgttt 114 0 

acagtaagtg tgattaactt tagttgggaa ggcagagggt gactgggtgc tgactggttt 1200 

atttatgtat atgttttttt atgtgtaggt cccgtctctg acgtagatga gacccccact 1260 

tcagagtgca tttcatcacc cccagaaatt ggcgaggaac cgcccgaaga tattattcat 1320 

agaccagttg cagtgagagt caccgggcgg agagcagctg tggagagttt ggatgacttg 1380 

ctacagggtg gggatgaacc tttggacttg tgtacccgga aacgccccag gcactaagtg 1440 

ccacacatgt gtgtttactt aaggtgatgt cagtatttat agggtgtgga gtgcaataaa 1500 

atccgtgttg actttaagtg cgtgttttat gactcagggg tggggactgt gggtatataa 1560 

gcaggtgcag acctgtgtgg tcagttcaga gcaggactca tggagatctg gactgtcttg 1620 

gaagactttc accagactag acagttgcta gagaactcat cggagggagt ctcttacctg 1680 

tggagattct gcttcggtgg gcctctagct aagctagtct atagggccaa acaggattat 1740 

aaggaacaat ttgaggatat tttgagagag tgtcctggta tttttgactc tctcaacttg 1800 

ggccatcagt ctcactttaa ccagagtatt ctgagagccc ttgacttttc tactcctggc 1860 

agaactaccg ccgcggtagc cttttttgcc tttattcttg acaaatggag tcaagaaacc 1920 

catttcagca gggattaccg tctggactgc ttagcagtag ctttgtggag aacatggagg 1980 

tgccagcgcc tgaatgcaat ctccggctac ttgccagtac agccggtaga cacgctgagg 2040 
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atcc C9 a 9tc tccagtcacc ccaggaacac caacgccgcc agcagccgca gcaggagcag 2100 

cagcaagagg aggaccgaga agagaacccg agagccggtc tggaccctcc ggtggcggag 21 60 

gaggaggagt agctgacttg tttcccgagc tgcgccgggt gctg act agg tcttcoagtg 2220 

9acgggagag ggggattaag cgggagaggc atgaggagac tagccacaga -actgaactga 2280 

ctgtcagtct gatgagccgc aggcgcccag aatcgg t g C g gtggcatgag g t gcag«:cgc 2340 

aggggataga tgaggtctcg gtgatgcatg agaaatattc cctagaacaa gtcaagactt 2400 

9ttgg tt gga gcccgaggat gattgggagg tagccatcag gaattatgcc aagctggctc 2 460 

tgaagccaga caagaagtac aagattacca aactgattaa tatcagaaat tcctgctaca 2520 

tttcagggaa tggggccgag gtggagatca gtacccagga gagggtggcc ttcagatgtt 2580 

gtatgatgaa tatgtacccg gggg t gg tgg gcatggaggg agtcaccttt atgaacacga 2640 
ggttcagggg tgatgggtat aat ggggtgg tctttatggc caacaccaag ctgacagtgc 

acggatgctc cttctttggc ttcaataaca tgtgcatcga ggcctggggc agtgtttcag 2760 

tgaggggatg cagcttttca gccaactgga tggggg tC g C gggcagaacc aagagcaagg 2820 

tgtcagtgaa gaaatgcctg ttcgagaggt gccacctggg ggtg atgagc gagggcgaag 2880 

ccaaagtcaa acactgcgc.c . Cctaccgaga cgggctgctt tgtgctgatc aagggcaatg 2940 

cccaagtcaa gcataacatg atctgtgggg cctcggatga gcgcggctac cagatgctga 3000 

cctgcgccgg tgggaacagc catatgctgg ccaccgtgca tgtggcctcg cacccccgca 3060 

agacatggcc cgagttcgag cacaacgtca tgacccgctg caatgtgcac ctgggctccc 3120 
gccgaggcat gttcatgccc Caccagtgca acatgcaatt tgtgaaggtg ct 9Ct g gagc 
ccgatgccat gtccagagtg agcctgacgg g gg tg ttt ga catgaatgtg gagctgtgga 
aaattctgag atatgatgaa tccaagacca ggtgccgggc ctgcgaatgc ggaggcaagc 

acgccaggct tcagcccgtg tg tgtggagg t g aC g 9agga cctgcgaccc gatc.tttgg 3360 

tgttgtcctg caacgggacg gagttcggct ccagcgggga agaat ctgac tagagtgagt 34 20 

agtgtttggg gc t ggg tgtg agpctgcatg aggggcagaa tgactaaaat ctgtggtttt 3480 

ctgtgtgttg cagcagcatg agcggaagcg cctcctttga gggaggggta ttcagccctt 3540 

atctgacggg gcgtctcccc tcctgggcgg gagtgcgtca gaatgtgatg. ggatccacgg 3600 

tggacggccg gcccgtgcag cccgcgaact cttcaaccct gacctacgcg accctgagct 3660 

cctcgtccgt ggacgcagct gccgccgcag ctgctgcttc cgccgccagc gccgtgcgcg 37 2 0 
gaatggccct gggcgccggc tactacagct ctctggtggc caactcgagt tccaccaata 



3180 
3240 
3300 



3780 
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atcccgccag cctgaacgag gagaagctgc tgctgctgat ggcccagctc gaggccctga 
cccagcgcct gggcgagctg acccagcagg tggctcagct gcaggcggag acgcgggccg 
cggttgccac ggtgaaaa.c aaataaaaaa tgaatcaata aataaacgga gacggttgtt 
gattttaaca ca 9 agtc tt g aatctttatt tga ttttt cg cgcgcggtag gccctggacc 
accggtctcg atcattgagc acccggtgga tcttttccag gacccggtag aggtgggctt 
99atgttgag gtacatgggc atgagcccgt cccgggggtg gaggtagctc cattgcaggg 
cctcgtgctc ggggatggtg ttgtaaatca cccagtcata gcaggggcgc agggcgtggt 
gctgcacgat gtccttgagg aggagactga tggccacggg cagccccttg g.tgtaggtgt 
tgacgaacct gttgagctgg gagggatgca tgcgggggga gatgagatgc atcttggcct 
9 gatcttgag attggcgatg ttcccgccca gatcccgccg ggggttcatg ttgtgcagga 
ccaccagcac ggtgtatccg gtgcacttgg ggaatttgtc atgcaacttg gaagggaagg 
cgtgaaagaa tttggagacg cccttgtgac cgcccaggtt ttccatgcac tcatccatga 
tgatggcgat gggcccgtgg gcggcggcct gggcaaagac gtttcggggg tcggacacat 
cgtagttgtg g tcct ggg tg agctcgtcat aggccatttt aatgaatctg gggcggaggg 
tgcccgactg ggggacgaag gtgccctcga tcccgggggc gtagttgccc tcgcagatct 
gcatctccca ggccttgagc tcggaggggg ggatcatgtc cacctgcggg gcgatgaaaa 
aaacggtttc cggggcgggg gagatgagct gggccgaaag caggttccgg agcagctggg 
acttgccgca accggtgggg ccgtagatga ccccgatgac cggctgcagg tggtagttga 
gggagagaca gctgccgtcc tcgcggagga ggggggccac ctcgttcatc atctcgcgca 
catgcatgtt ctcgcgcacg agttccgcca ggaggcgctc gccccccagc gagaggagct 
cttgcagcga ggcgaagttt ttcagcggct tgagtccgtc ggccatgggc attttggaga 
gggtctgttg caagagttcc agacggtccc agagctcggt gatgtgctct agggcatctc 
gatccagcag acctcctcgt ttcgcgggtt ggggcgactg cgggagtagg gcaccaggcg 
atgggcgtcc agcgaggcca gggtccggtc cttccagggc cgcagggtcc gcgtcagcgt 
ggtctccgtc acggtgaagg ggtgcgcgcc gggctgggcg cttgcgaggg tgcgcttcag 
gctcatccgg ctggtcgaga accgctcccg gtcggcgccc tgcgcgtcgg ccaggtagca 
attgagcatg agtccgtagt tgagcgcctc ggccgcgtgg cccttggcgc ggagcttacc 
tttggaagtg tgtccgcaga cgggacagag gagggacttg agggcgtaga gcttgggggc 5460 



3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
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gaggaagacg gactcggggg cgtaggcgtc cgcgccgoag ccggcgcaga cggtctcgca 
ctccacgagc caggtgaggt cggggcggtt ggggtcaaaa acgaggtttc ctccgtgctt 
tttgatgcgt ttcttacctc tggtctccat gagctcgtgt ccccgctggg tgacaaagag 
gctgtccgtg tccccgtaga ccgactttat gggccggtcc tcgagcgggg tgccgcggtc 
ctcgtcgtag aggaaccccg cccactccga gacgaaggcc cgggtccagg ccagcacgaa 
ggaggccacg tgggaggggt agcggtcgtt gtccaccagc gggtccacct tctccagggt 
atgcaagcac atgtccccct cgtccacatc caggaaggtg attggcttgt aagtgtaggc 
cacgtgaccg ggggtcccgg ccgggggggt ataaaagggg gcgggcccct gctcgtcctc 
actgtcttcc ggatcgctgt ccaggagcgc cagctgttgg ggtaggtatt ccctctcgaa 
ggcgggcatg acctcggcac tca ggttgtc agtttctaga aacgaggagg atttgatatt 
gacggtgccg ttggagacgc ctttcatgag cccctcgtcc atttggtcag aaaagacgat 
ctttttgttg tcgagcttgg tggcgaagga gccgtagagg gcgttggaga gcagcttggc 
gatggagcgc atggtctggt tcttttcctt gtcggcgcgc tccttggcgg cgatgttgag 
ctgcacgtac.tcgcgcgcca cgcacttcca ttcggggaag acggtggtga gctcgtcggg 
cacgattctg acccgccagc cgcggttgtg cagggtgatg aggtccacgc tggtggccac 
ctcgccgcgc aggggctcgt tggtccagca gaggcgcccg cccttgcgcgagcagaaggg 
gggcagcggg tccagcatga gctcgtcggg ggggtcggcg tccacggtga agatgccggg 6480 
caggagctcg gggtcgaagt agctgatgca ggtgcccaga ttgtccagcg ccgcttgcca 6540 
gtcgcgcacg gccagcgcgc gctcgtaggg gctgaggggc gtgccccagg gcatggggtg 6600 
cgtgagcgcg gaggcgtaca tgccgcagat gtcgtagacg tagaggggct cctcgaggac 6660 
gccgatgtag gtggggtagc agcgcccccc gcggatgctg gcgcgcacgt agtcgtacag 
ctcgtgcgag ggcgcgagga gccccgtgcc gaggttggag cgttgcggct tttcggcgcg 
gtagacgatc tggcggaaga tggcgtggga gttggaggag atggtgggcc tttggaagat 
gttgaagtgg gcgtggggca ggccgaccga gt.ccctgatg aagtgggcgt aggagtcctg 
cagcttggcg acgagctcgg cggtgacgag gacgtccagg gcgcagtagt cgagggtctc 6960 
ttggatgatg tcatacttga gctggccctt ctgcttccac agctcgcggt tgagaaggaa 
ctcttcgcgg tccttccagt actcttcgag ggggaacccg tcctgatcgg cacggtaaga 
gcccaccatg tagaactggt tgacggcctt gtaggcgcag cagcccttct ccacggggag 
ggcgtaagct tgcgcggcct tgcgcaggga ggtgtgggtg agggcgaagg tgtcgcgcac 



5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 



6720 
6780 
6840 
6900 



7020 
7080 
7140 
7200 
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catgaccttg aggaactqqt acttaaaat-r a^aatmf^ 


cagccgccct 


gctcccagag 


7260 


ttggaagtcc Qtgcgcttct tataaacaaa att-aanr**** 


gcgaaagtaa 


catcgttgaa 


7320 


gaggatcttg cccqcqcaqa acataaaot-t* omani-ftflf« 


eggaaagget 


ggggcacctc 


7380 


ggcccggttg ttgatcracct aaaraarnan narnafoh«« 


tegaagcegt 


tgatgttgtg 


7440 


CCCqacgatq taqaqttcca caaatcaran nmnpr^h 

s 9 j ""3 w 5 i, ^v\,B v>yaacLy^yy ycygcCCCCg 


acgtggggca 


gcttcttgag 


7500 


>»^yy <-yyyy£.cgcc gagcccgtgc 


tgctcgaggg 


cccagtcggc 


7560 


a^y^yyyyy ^wyyuycuya. gyaaggaagc CCagagatCC 


aeggecaggg 


eggtctgeaa 


7620 


qcqqtcccqq tactaacaoa art-nt-f _„ ^ »_ 
sg-vv^g ua^Lya^yya aCCy ULyyCC CSCggcCa t 1 


ttttcggggg 


tgacgcagta 


7680 


qaaqqtqcqq QQQ t CO rrnh nrrarrpnnf n /i/fa/tv^n*- 

3 -s^^a^y yyy^y^v.31. yccagcggtc ccacttgagc 


tggagggega 


ggtcgtgggc 


7740 


qaqCtCqaCq aQCaOCaoah r , r t r*r~nriar%zar* t-ht- M!1 t- M ^» 

3 3 w>. 3 «v.*j oyvyy^yyyt, ccccggagag tcccatgacc 


agcatgaagg 


ggacgagctg 


7800 


c t tqccqaao oaccccafrr annhnhsn/tf ^ 

-3 -J yattcwtcc a ggcgcaggt ttccacatcg 


ta 99tgagga 


agagecttte 


7860 


qq tqcqaqqa tQCGaoccaa harrm-man^a «- _ 

3 J '-yv.yayv.v.yd L 3999 aa 9»» CCggatCCCC 


tgccaccagt 


tggaggaatg 


7920 


3 3 1 33 aa 3 *- ayaaatyCCy aCQqCQCQCC 


gagcactcgt 


gcttgtgttt 


7980 


atacaaqcqt ccocaot'arh mr>aar<nrfn _ „ »_ ^ _ 
^ -3 *~ ^^-y^ayLyn, cycaacgcxg cacgggatgc 


aegtgetgea 


cgagctgtac 


8040 


. v*-yyv.yoyya acctcagcgg gcagtggagc 


getggegget 


geatctegtg 


:. eioo 


ctqtactacq tcfctaaceah rbarat- nnr-r* a ^ -,»-«^-* 
3 »«^i<wyyv.Lav. ^yycycggcc accgcccgcc 


tcgatggtgg 


teatgetgae 


8160 


gagcccgcgc gggaggcaqq tccaaacctc aap^^^aarn 


ggteggagag 


cgaggacgag 


8220 


ggcgcgcagg ccggaqctqt ccaaaateer aaaar-o^t-o^ 


ggagtcaggt 


cagtgggcag 


8280 


cggcggcgcg cqqttqactt acaaoaarhh t- h rrarmrt^ 


cgcgggaggt 


ccagatggta 


8340 


3 w w w. «*^yyv*yv.tyi. tyy uyy CCdC g LCCacggC t 


tgcagggtgc 


cgtgcccctg 


8400 


333vvjk,Lav,t <n.tgtgcccc gcttcctctt gggegctget 


tecatgtegg 


teagaagegg 


8460 


*-yy^yeigyac gcgcgccggg eggcagggge ggctegggge 


ccggaggcag 


gggeggcagg 


8520 


yyk.quyi.Lyy cgccgcgcgc gggcaggttc tggtactgcg 


cccggagaag 


actggcgtga 


8580 


gcgacgacgc gaeggttgae gtcctggatc tgacgcctct 


gggtgaaggc 


cacgggaccc 


8640 


gtgagtttga acctgaaaga gagttcgaca gaatcaatct 


eggtategtt 


gacggcggcc 


8700 


tgccgcagga tctcttgcac gtcgcccgag ttgtcctggt 


aggegatetc 


ggtcatgaac 


8760 


tgetcgatet cctcctcctg aaggtctccg cggccggcgc 


getcgaeggt 


ggecgegagg 


8820 


tcgttggaga tgcggcccat gagctgegag aaggcgttca 


tgccggcctc 


gttccagacg 


8880 
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cggctgtaga 


ccacggctcc gtcggggtcg 


cgcgcgcgca 


tgaccacctg 


ggcgaggttg 


8940 


agctcgacgt 


ggcgcgtgaa gaccgcgtag 


ttgcagaggc 


gctggtagag 


gtagttgagc 


9000 


gtggtggcga 


tgtgctcggt gacgaagaag 


tacatgatcc 


agcggcggag 


cggcatctcg 


9060 


ctgacgtcgc 


ccagggcttc caagcgttcc 


atggcctcgt 


agaagtccac 


ggcgaagttg 


9120 


aaaaactggg 


agttgcgcgc cgagacggtc 


aactcctcct 


ccagaagacg 


gatgagctcg 


91B0 


gcgatggtgg 


cgcgcacctc gcgctcgaag 


gccccggggg 


gctcctcttc 


catctcctcc 


9240 


tcttcctcct 


ccactaacat ctcttctact 


tcctcctcag 


gaggcggtgg 


c gggggaggg 


9300 


gccctgcgtc 


gccggcggcg cacgggcaga 


cggtcgatga 


agcgctcgat 


ggtctccccg 


9360 


cgccggcgac 


gcatggtctc ggtgacggcg 


cgcccgtcct 


cgcggggccg 


cagcatgaag 


9420 


acgccgccgc 


gcatctccag gtggccgccg 


ggggggtctc 


cgttgggcag 


ggagagggcg 


9480 


ctgacgatgc 


atcttatcaa ttgacccgta 


gggactccgc 


gcaaggacct 


gagcgtctcg 


9540 


agatccacgg 


gatccgaaaa ccgctgaacg 


aaggcttcga 


gccagtcgca 


gtcgcaaggt 


9600 


aggctgagcc 


cggtttcttg ttcttcgggt 


atttggtcgg 


gaggcgggcg 


ggcgatgctg 


9660 


ctggtgatga 


agttgaagta ggcggtcctg 


agacggcgga 


tggtggcgag 


gagcaccagg 


9720 


tccttgggcc 


cggcttgctg. gatgcgcaga 


cggtcggcca 


tgccccaggc 


gtggtcctga 


9780 


cacctggcga 


ggtccttgta gtagtcctgc 


atgagccgct 


ccacgggcac 


ctcctcctcg 


9840 


cccgcgcggc 


cgtgcatgcg cgtgagcccg 


aacccgcgct 


gcggctggac 


gagcgccagg 


9900 


tcggcgacga 


cgcgctcggt gaggatggcc 


tgctggatct 


gggtgagggt 


ggtctggaag 


9960 


tcgtcgaagt 


cgacgaagcg gtggtaggct 


ccggtgttga 


tggtgtagga 


gcagttggcc 


10020 


atgacggacc 


agttgacggt ctggtggccg 


ggtcgcacga 


gctcgtggta 


cttgaggcgc 


10080 


gagtaggcgc 


gcgtgtcgaa gatgtagtcg 


ttgcaggcgc 


gcacgaggta 


ctggtatccg 


10140 


acgaggaagt 


gcggcggcgg ctggcggtag 


agcggccatc 


gctcggtggc 


gggggcgccg 


10200 


ggcgcgaggt 


cctcgagcat gaggcggtgg 


tagccgtaga 


tgtacctgga 


catccaggtg 


10260 


atgccggcgg 


cggtggtgga ggcgcgcggg 


aactcgcgga 


cgcggttcca 


gatgttgcgc 


10320 


agcggcagga 


agtagttcat ggtggccgcg 


gtctggcccg 


tgaggcgcgc 


gcagtcgtgg 


10380 


atgctctaga 


catacgggca aaaacgaaag 


cggtcagcgg 


ctcgactccg 


tggcctggag 


10440 


gctaagcgaa 


c 9ggttgggc tgcgcgtgta 


ccccggttcg 


aatctcgaat 


caggctggag 


10500 


ccgcagctaa 


cgtggtactg gcactcccgt 


ctcgacccaa 


gcctgctaac 


gaaacctcca 


10560 


ggatacggag 


gcgggtcgtt ttttggcctt 


ggtcgctggt 


catgaaaaac 


tagtaagcgc 


10620 
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aaaaaacacrr 


cgcccgcgat ggctcgctgc 


cgtagtctgg 


agaaagaatc 


gccagggttg 


10680 


v«y *• ^y <-yy ty 


tgccccggtt cgagcctcag 


cgctcggcgc 


eggceggatt 


ccgcggctaa 


10740 


cataaacata 
^sya ^-y ^-y 


gctgccccgt cgtttccaag 


accccttagc 


cagccgactt 


ctccagttac 


10800 


CfOr? nrnanrr* 
yyay^y ciycc 


cctctctttt tttcttgtgt 


ttttgecaga 


tgcatcccgt 


actgeggcag 


10860 




accctccacc acaaccgccc 


ctaccgcagc 


agcagcaaca 


gccggcgctt 


10920 




cccagcagca gccagccact 


accgcggcgg 


ccgccgtgag 


cggagccggc 


10980 




acctggcctt ggaagagggc 


gaggggctgg 


cgcggctggg 


ggegtegteg 


11040 


ccggagcggc 


acccgcgcgt gcagatgaaa 


agggacgetc 


gegaggecta 


cgtgcccaag 


11100 


cdgaacccgc 


tcagagacag gageggegag 


gageccgagg 


agatgegege 


ctcccgcttc 


11160 


cacgcggggc 


gggagctgcg gcgcggcctg 


gaccgaaagc 


gggtgctgag 


ggacgaggat 


11220 


c Lcgaggcgg 


acgagctgac ggggatcagc 


cccgcgcgcg 


cgcacgtggc 


cgcggccaac 


11280 


c egg ecaegg 


egtacgagea gaccgtgaag 


gaggagagca 


acttccaaaa 


atccttcaac 


11340 


aaccacg tgc 


geaegctgat cgcgcgcgag 


gaggtgaccc 


tgggectgat 


gcacctgtgg 


11400 


gacccgctgg 


aggecategt gcagaacccc 


acgagcaagc 


cgctgacggc 


gcagctgttt 


11460 


ctygcggcgc 


agcacagtcg ggacaacgag 


acgttcaggg 


aggegctget 


gaatatcacc 


11520 


Cf A a f f» a rtn 
y ciy \^^L,y ciyy 


gccgctggct cctggacctg 


gtgaacattt 


tgcagagcat 


cgtggtgcag 


11580 


yagegeggge 


tgccgctgtc cgagaagctg 


gcggccatca 


acttcteggt 


gctgagtctg 


11640 


yycuagcacc 


aegctaggaa gatctacaag 


accccgtacg 


tgeccataga 


caaggaggtg 


11700 


aagatcgacg 


ggttttacat gcgcatgacc 


ctgaaagtgc 


tgaccctgag 


cgacgatctg 


11760 


ggggtgtacc 


gcaacgacag gatgcaccgc 


gcggtgagcg 


ccagccgccg 


gcgcgagctg 


11820 


agegaccagg 


agetgatgea cagcctgcag 


cgggccctga 


ccggggccgg 


gaccgagggg 


11680 


gagagctact 


ttgacatggg cgcggacctg 


cgctggcagc 


ccagccgccg 


ggccttggaa 


11940 


gctgccggcg 


gttcccccta cgtggaggag 


gtggacgatg 


aggaggagga 


gggegagtae 


12000 


ctggaagact 


gatggegega ccgtattttt 


getagatgea 


gcaacagcca 


ccgccgccgc 


12060 


ctcctgatcc 


cgegatgegg gcggcgctgc 


agagccagcc 


gtceggcatt 


aactcctcgg 


12120 


acgattggac 


ccaggccatg caacgcatca 


tggegctgae 


gacccgcaat 


cccgaagcct 


12180 


ttagacagca 


gcctcaggcc aaccggctct 


cggccatcct 


ggaggccgtg 


gtgccctcgc 


12240 


gctcgaaccc 


cacgcacgag aaggtgctgg 


ccatcgtgaa 


cgcgctggtg 


gagaacaagg 


12300 
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ccatcc g cgg tgacgaggcc gggctggtgt acaacgcgct gctggagcgc gtggcccgct 12360 

acaacagcac caacgtgcag acgaacctgg accgcatggt gaccgacgtg cgcgaggcgg 12420 

tgtcgcagcg cgagcggttc caccgcgagt cgaacctggg ctccatggtg gcgctgaacg 12480 

ccttcctgag cacgcagccc gccaacgtgc cccggggcca ggaggactac accaacttca 12540 

tcagcgcgct gcggctgatg gtggccgagg tgccccagag cgaggtgtac cagtcggggc 12600 

cggactactt cttccagacc agtcgccagg gcttgcagac cgtgaacctg agccaggctt 12660 

tcaagaactt gcagggactg tggggcgtgc aggccccggt cggggaccgc gcgacggtgt 12720 

cgagcctgct gacgccgaac tcgcgcctgc tgctgctgct ggtggcgccc ttcacggaca 12780 

gcggcagcgt gagccgcgac tcgtacctgg gctacctgct taacctgtac cgcgaggcca 1284 0 

tcggacaggc gcacgtggac gagcagacct accaggagat cacccacgtg agccgcgcgc 12900 

tgggccagga ggacccgggc aacctggagg ccaccctgaa cttcctgctg accaaccggt 12960 

cgcagaagat cccgccccag tacgcgctga gcaccgagga ggagcgcatc ctgcgctacg 13020 

tgcagcagag cgtggggctg ttcctgatgc aggagggggc cacgcccagc gcggcgctcg 13080 

acatgaccgc gcgcaacatg gagcccagca tgtacgcccg caaccgcccg ttcatcaata 13140 

agctgatgga ctacttgcat cgggcggccg ccatgaactc ggactacttt accaacgcca 13200 

tcttgaaccc gcactggctc ccgccgcccg ggttctacac gggcgagtac gacatgcccg . 13260 

accccaacga cgggttcctg tgggacgacg tggacagcag cgtgttctcg ccgcgtccag 13320 

gaaccaatgc cgtgtggaag aaagagggcg gggaccggcg gccgtcctcg gcgctgtccg 13380 

gtcgcgcggg tgctgccgcg gcggtgcccg aggccgccag ccccttcccg. agcctgccct 13440 

tttcgctgaa cagcgtgcgc agcagcgagc tgggtcggct gacgcgaccg cgcctgctgg 13500 

gcgaggagga gtacctgaac gactccttgt tgaggcccga gcgcgagaag aacttcccca 13 560 

ataacgggat agagagcctg gtggacaaga tgagccgctg gaagacgtac gcgcacgagc 13620 

acagggacga gccccgagct agcagcgcag gcacccgtag acgccagcgg cacgacaggc 13680 

agcggggact ggtgtgggac gatgaggatt ccgccgacga cagcagcgtg ttggac.ttgg 13740 

gtgggagtgg tggtaacccg ttcgctcacc tgcgcccccg tatcgggcgc ctgatgtaag 13800 

aatctgaaaa aataaaagac ggtactcacc aaggccatgg cgaccagcgt gcgttcttct 13860 

ctgttgtttg tagtagtatg atgaggcgcg tgtacccgga gggtcctcct ccctcgtacg 13920 

agagcgtgat gcagcaggcg gtggcggcgg cgatgcagcc cccgctggag gcgccttacg 13980 

tgcccccgcg gtacctggcg cctacggagg ggcggaacag cattcgttac tcggagctgg 14040 
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cacccttgta cgataccacc cggttgtacc tggtggacaa caagtcggca gacatcgcct 14100 

cgctgaacta ccagaacgac cacagcaact tcctgaccac cgtggtgcag aacaacgatt 14160 

tcacccccac ggaggccagc acccagacca tcaactttga cgagcgctcg cggtggggcg 14220 

gccagctgaa aaccatcatg cacaccaaca tgcccaacgt gaacgagttc atgtacagca 14280 

acaagttcaa ggcgcgggtg atggtctcgc gcaagacccc caacggggtg gatgatgatt 14340 

atgatggtag tcaggacgag ctgacctacg agtgggtgga gtttgagctg cccgagggca 14400 

acttctcggt gaccatgacc atcgatctga tgaacaacgc catcatcgac aactacttgg 14460 

cggtggggcg gcagaacggg gtgctggaga gcgacatcgg cgtgaagttc gacacgcgca 14520 

acttccggct gggctgggac cccgtgaccg agctggtgat gccgggcgtg tacaccaacg 14 580 

aggccttcca ccccgacatc gtcctgctgc ccggctgcgg cgtggacttc accgagagcc 14640 

gcctcagcaa cctgctgggc atccgcaagc ggcagccctt ccaggagggc ttccagatcc 14700 

tgtacgagga cctggagggg ggcaacatcc ccgcgctctt ggatgtcgaa gcctacgaga 14760 

aaagcaagga ggatagcacc gccgcggcga ccgcagccgt ggccaccgcc tctaccgagg 14 820 

tgcggggcga taattttgct agcgctgcgg cagcggccga ggcggctgaa accgaaagta 14880 

agatagtcat ccagccggtg gagaaggaca gcaaggacag gagctacaac gtgctcgcgg 14940 

acaagaaaaa caccgcctac cgcagctggt acctggccta caactacggc gaccccgaga 15000 

agggcgtgcg ctcctggacg ctgctcacca cctcggacgt cacctgcggc gtggagcaag 15060 

tctactggtc gctgcccgac atgatgcaag acccggtcac cttccgctcc acgcgtcaag 15120 

ttagcaacta cccggtggtg ggcgccgagc tcctgcccgt ctactccaag agcttcttca 15180 

acgagcaggc cgtctactcg cagcagctgc gcgccttcac ctcgctcacg cacgtcttca 15240 

accgcttccc cgagaaccag atcctcgtcc gcccgcccgc gcccaccatt accaccgtca 15300 

gtgaaaacgt tcctgctctc acagatcacg ggaccctgcc gctgcgcagc agtatccggg 15360 

gagtccagcg cgtgaccgtc actgacgcca gacgccgcac ctgcccctac gtctacaagg 15420 

ccctgggcgt agtcgcgccg Cgcgtcctct cgagccgcac cttctaaaaa atgtccattc 15480 

tcatctcgcc cagtaataac accggttggg gcctgcgcgc gcccagcaag atgtacggag 15540 

gcgctcgcca acgctccacg caacaccccg tgcgcgtgcg cgggcacttc cgcgctccct 15600 

ggggcgccct caagggccgc gtgcgctcgc gcaccaccgt cgacgacgtg atcgaccagg 15660 

tggtggccga cgcgcgcaac tacacgcccg ccgccgcgcc cgtctccacc gtggacgccg 15720 
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tcatcgacag 


cgtggtggcc 


gacgcgcgcc 


ggtacgcccg caccaagagc 


cggcqqcqqc 

3 3 *3 3 ^ 3 3 


15780 


gcatcgcccg 


gcggcaccgg 


agcacccccg 


ccatgcgcgc ggcgcgagcc 


ttgctgcgca 


15840 


gggccaggcg 


cacgggacgc 


agggccatgc 


tca 99gcggc cagacgcgcg 


QCCtccaacA 

3 ^ 33 " 


15900 

JL37UV 


gcagcagcgc 


cggcaggacc 


cgcagacgcg 


cggccacggc ggcggcggcg 

^ WW WW W W W 


QCCatcacra 


1 COCA 


gcatgtcccg 


cccgcggcgc 


ggcaacgtgt 


actgggtgcg cgacgccgcc 


accaotat'ar 




gcgtgcccgt 


gcgcacccgc 


ccccctcgca 


cttgaagatg ctgacttcgc 


Qatottaaha 


1DU O v 


tgtcccagcg 


gcgaggagga 


tgtccaagcg 


caaatacaag gaagaqatqc 






cgcgcctgag 


atctacggcc 


ccgcggcggc 


ggtgaaggag gaaagaaagc 


cccocaaacf* 


16200 


gaagcgggtc 


aaaaaggaca 


aaaaggagga 


ggaagatgac ggactqqtaq 


a 3*»i» i»y ^~y ^y 




cgagttcgcc 


ccccggcggc 


gcgtgcagtg 


gcgcgggcgg aaagtgaaac 


cqqtactaca 

3Z3 *~3 *^ *- 3 ^y 


16320 


gcccggcacc 


acggtggtct 


tcacgcccgg 


cgagcgttcc ggctccgcct 




X OJO u 


ctacgacgag 


gtgtacgggg 


acgaggacat 


cctcgagcag gcggtcgagc 


q t C tqqqcqa 


16440 


gtttgcgtac 


ggcaagcgca 


gccgccccgc 


gcccttgaaa gaqgaqqcqq 


tQtccatrrr 


J.U3UV 


gctggaccac 


ggcaacccca 


cgccgagcct 


gaagccggtg accctgcagc 


aqqtgcta.cc 

33 v 3 > * ^ ^ 


16560 


gagcgcggcg 


ccgcgccggg 


gcttcaagcg 


cgagggcggc gaggatctqt 


acccaacnA t" 


JIODZU 


gcagctgatg 


gtgcccaagc 


gccagaagct 


ggaggacqtq ctqqaacaca 


uy dciyy i-yy a 


■ IbboU 


ccccgaggtg 


cagcccgagg 


tcaaggtgcg 


gcccatcaag caggtggccc 


C qqoc c t a era 

v 333^ v u 333 


16740 


cgtgcagacc 


gtggacatca 


agatccccac 


ggagcccatg gaaacgcaga 




X O D \J KJ 


gaagcccagc 


accagcacca 


tggaggtgca 


gacggatccc tggatgccag 


caccaacttc 


16860 


caccagcact 


cgccgaagac 


gcaagtacgg 


cgcggccagc ctqctqatqc 

w w w w ^* w w «3 




i soon 

X03*U 


gctgcatcct 


tccatcatcc 


ccacgccggg 


ctaccgcggc acgcgcttct 


accacaacta 


16980 


caccagcagc 


cgccgccgca 


agaccaccac 


ccgccgccgt cgtcgcagcc 


QCCQcaocaa 


17040 


caccgcgact 


tccgccttgg 


tgcggagagt 


gtatcgcagc gggcgcgagc 


ctctgaccc t 


17100 


gccgcgcgcg 


cgctaccacc 


cgagcatcgc 


catttaacba ccgcctccta 






tggccctcac 


atgccgcctc 


cgcgtcccca 


ttaCQQqcta ccoaaoaaaa 
ww **"333 ww ** »»v»y»yy aaya 


adyccgcgcc 


17220 


gtagaaggct 


gacggggaac 


gggctgcgtc 


gccatcacca ccggcggcgg 


cgcgccatca 


17280 


gcaagcggtt 


999gggaggc 


ttcctgcccg 


cgctgatccc catcatcgcc 


geggegateg 


17340 


gggcgatccc 


cggcatagct 


tccgtggcgg 


tgcaggcctc tcagcgccac 


tgagacacaa 


17400 


aaaagcatgg 


atttgtaata 


aaaaaaaaaa 


tggactgacg ctcctggtcc 


tgtgatgtgt 


17460 
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gtttttagat ggaagacatc aatttttcgt ccctggcacc gcgacacggc acgcggccgt 17520 

ttatgggcac ctggagcgac atcggcaaca gccaactgaa cgggggcgcc ttcaattgga 17580 

gcagtctctg gagcgggctt aagaatttcg ggtccacgct caaaacctat ggcaacaagg 17640 

cgtggaacag cagcacaggg caggcgctga gggaaaagct gaaagaacag aacttccagc 17700 

agaaggtggt tgatggcctg gcctcaggca tcaacggggt ggttgacctg gccaaccagg 17760 

ccgtgcagaa acagatcaac agccgcctgg acgcggtccc gcccgcgggg tccgtggaga 17820 

tgccccaggt ggaggaggag ctgcctcccc tggacaagcg cggcgacaag cgaccgcgtc 17880 

ccgacgcgga ggagacgctg ccgacgcaca cggacgagcc gcccccgtac gaggaggcgg 17940 

tgaaactggg cctgcccacc acgcggcccg tggcgcctct ggccaccgga gtgctgaaac 18000 

ccagcagcag ccagcccgcg accctggact tgcctccgcc tcgcccctcc acagtggcta 18060 

agcccctgcc gccggtggcc gtcgcgtcgc gcgccccccg aggccgcccc caggcgaact 18120 

ggcagagcac tctgaacagc atcgtgggtc tgggagtgca gagtgtgaag cgccgccgct 18180 

gctattaaaa gacactgtag cgcttaactt gcttgtctgt gtgtatatgt atgtccgccg 18240 

accagaagga ggagtgtgaa gaggcgcgtc gccgagttgc aagatggcca ccccatcgat 183 00 

gctgccccag tgggcgtaca tgcacatcgc cggacaggac gcttcggagt acctgagtcc 18360 

gggtctggtg cagttcgccc gcgccacaga cacctacttc agtctgggga acaagtttag 18420 

gaaccccacg gtggcgccca cgcacgatgt gaccaccgac cgcagccagc ggctgacgct 18480 

gcgcttcgtg cccgtggacc gcgaggacaa cacctactcg tacaaagtgc gctacacgct 18540 

ggccgtgggc gacaaccgcg tgctggacat ggccagcacc tactttgaca tccgcggcgt 18600 

gctggaccgg ggccctagct tcaaacccta ctctggcacc gcctacaaca gcctagctcc 18660 

caagggagct cccaattcca gccagtggga gcaagcaaaa acaggcaatg ggggaactat 18720 

ggaaacacac acatatggtg tggccccaat gggcggagag aatattacaa aagatggtct 18780 

tcaaattgga actgacgtta cagcgaatca gaataaacca atttatgccg acaaaacatt 1884 0 

tcaaccagaa ccgcaagtag gagaagaaaa ttggcaagaa actgaaaact tttatggcgg 18900 

tagagctctt aaaaaagaca caaacatgaa accttgctat ggctcctatg ctagacccac 18960 

caatgaaaaa ggaggtcaag ctaaacttaa agttggagat gatggagttc caaccaaaga 19020 

attcgacata gacctggctt tctttgatac tcccggtggc accgtgaacg gtcaagacga 19080 

gtataaagca gacattgtca tgtataccga aaacacgtat ttggaaactc cagacacgca 19140 
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tgtggtatac 


aaaccaggca 


aggatgatgc 


aagttctgaa 


attaacctgg ttcagcagtc 


19200 


tatgcccaac 


agacccaact 


acattgggtt 


cagggacaac 


tttatcggtc ttatgtacta 


19260 


caacagcact 


ggcaatatgg 


gtgtgcttgc 


tggtcaggcc 


tcccagctga atgctgtggt 


19320 


tgatttgcaa 


gacagaaaca 


ccgagctgtc 


ctaccagctc 


ttgcttgact ctttgggtga 


19380 


cagaacccgg 


tatttcagta 


tgtggaacca 


99 c g9tggac 


agttatgacc 


ccgatgtgcg 


19440 


catcatcgaa 


aaccatggtg 


tggaggatga 


attgccaaac 


tattgcttcc 


ccttggacgg 


19500 


ctctggcact 


aacgccgcat 


accaaggtgt 


gaaagtaaaa 


gatggtcaag 


atggtgatgt 


19560 


tgagagtgaa 


tgggaaaatg 


acgatactgt 


tgcagctcga 


aatcaattat 


gtaaaggtaa 


19620 


cattttcgcc 


atggagatta 


atctccaggc 


taacctgtgg 


agaagtttcc 


tctactcgaa 


19680 


cgtggccctg 


tacctgcccg 


actcctacaa 


gtacacgccg 


accaacgtca 


cgctgccgac 


19740 


caacaccaac 


acctacgatt 


acatgaatgg 


cagagtgaca 


cctccctcgc 


tggtagacgc 


19800 


ctacctcaac 


atcggggcgc 


gctggtcgct 


ggaccccatg 


gacaacgtca 


accccttcaa 


19860 


ccaccaccgc 


aacgcgggcc 


tgcgctaccg 


ctccatgctc 


ctgggcaacg 


ggcgctacQt 


19920 


gcccttccac 


atccaggtgc 


cccaaaagtt 


tttcgccatc 


aagagcctcc 


tgctcctgcc 


19980 


cgggtcctac 


acctacgagt 


ggaacttccg 


caaggacgtc 


aacatgatcc 


tgcagagctc 


20040 


cctaggcaac 


gacctgcgca 


cggacggggc 


ctccatcgcc 


ttcaccagca 


tcaacctcta 


20100 


cgccaccttc 


ttccccatgg 


cgcacaacac 


cgcctccacg 


ctcgaggcca 


tgctgcgcaa 


20160 


cgacaccaac 


gaccagtcct 


tcaacgacta 


cctctcggcg 


gccaacatgc 


tctaccccat 


20220 


cccggccaac 


gccaccaacg 


tgcccatctc 


catcccctcg 


cgcaactggg 


ccgccttccg 


20280 


cggatggtcc 


ttcacgcgcc 


tgaagacccg 


cgagacgccc 


tcgctcggct 


ccgggttcga 


20340 


cccctacttc 


gtctactcgg 


gctccatccc 


ctacctagac 


oacacctfrri" 


acc t.caacca 


20400 


caccttcaag 


aaggtctcca 


tcaccttcga 


ctcctccgtc 


agctggcccg 


gcaacgaccg 


20460 


cctcctgacg 


cccaacgagt 


tcgaaatcaa 


gcgcaccgtc 


gacggagagg 


gatacaacgt 


20520 


ggcccagtgc 


aacatgacca 


aggactggtt 


cctggtccag 


atgctggccc 


actacaacat. 


20580 


cggctaccag 


ggcttctacg 


tgcccgaggg 


ctacaaggac 


cgcatgtact 


ccttcttccg 


20640 


caacttccag 


cccatgagcc 


gccaggtcgt 


ggacgaggtc 


aactacaagg 


actaccaggc 


20700 


cgtcaccctg 


gcctaccagc 


acaacaactc 


gggcttcgtc 


ggctacctcg 


cgcccaccat 


20760 


gcgccagggc 


cagccctacc 


ccgccaacta 


cccctacccg 


ctcatcggca 


agagcgccgt 


20820 


cgccagcgtc 


acccagaaaa 


agttcctctg 


cgaccgggtc 


atgtggcgca 


tccccttctc 


20880 
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cagcaacttc a tgtC ca t9g 9 c 9 c 9 ctcac c 9 acc t c 9 go cagaacatgc tctacgccaa 20940 

ctccgcccac gcgctagaca tgaatttcga ag tcg acccc a t gga t g agt ccacccttct 21000 

ctatgttgtc ttcgaagtct tcgacgtcgt ccgag t gcac cagccccacc gcggcgtcat 21060 

cgaagccgtc tacctgcgca cgcccttctc ggccggcaac gccaccacct aagccgetct 2H20 

tgcttcttgc aagatgacgg cgggctccgg cgagcaggag ctcagggcca tcctccgcga 21X80 

cctgggctgc gggccctgct tcctgggcac cttcgacaag cgcttccctg gattcatggc 2X240 

cccgcacaag ctggcctgcg ccatcgtgaa cacggccggc cgcgagaccg ggggcgagca 2X300 

ctggctggcc ttcgcctgga acccgcgctc ccacacatgc tacctcttcg accccttcgg 2X360 

9"c t cggac gagcgcctca agcagatcta ccagttcgag tacgagggcc tgccgcgtcg 2X420 

cagcgccctg gccaccgagg accgctgcgt caccctggaa aagtccaccc agaccgtgca 2X460 

gggtccgcgc tcggccgcct gcgggctctt ctgctgcatg ttcctgcacg ccttcgtgca 2X540 

ctggcccgac cgccccatgg acaagaaccc caccatgaac ttactgacgg gggtgcccaa 2X600 

cggca t gctc cagtcgcccc a 9gtgg aacc caccctgcgc cgcaaccagg aagcgctcta 2X660 

ccgcttcctc aatgoccact cc g cctac Ct tcgctccca, cgcgcgcgca tcgagaaggc 2X720 

caccgccttc gaccgcatga atcaagacat gtaaaaaacc ggtgtgtgta tgtgaatgct 2X780 

ttattcataa taaacagcac atgtttatgc caccttctct gaggctctga ctttatttag 2X840 

aaatcgaagg 9gtt ctgcc 3 gctctcggca tggcccgcgg gcagggatac gttgcggaac 2X900 

tggtacttgg gcagccactt gaactcgggg atcagcagct tgggcacg g g gaggtcgggg 2X960 

aacgagtcgc tccacagctt gcgcgtgagt tgcagggcgc ccagcaggtc gggcgcggag 22020 

atcttgaaat cgcag tt ggg acccgcgttc tgcgcgcgag agttgcggta cacggg gttg 2208 0 

cagcactgga acaccatcag ggccgggtgc ttcacgcttg ccagcaccgc cgcgtcggtg 22140 

atgccctcca cgtccagatc ctcggcgttg gccatcccga agggg gtC a t cttgcaggtc 22200 

tgccgcccca tgctgggcac gcagccgggc ttgtggttgc aatcgcagtg caggggg atc 22260 

agcatcatct gggcctgctc ggagctcatg cccggg t aca tggccttcat gaaagcctcc 22320 

agctggcgga aggcctgctg cgccttgccg ccctcggtga agaagacccc gcaggacttg 22380 

ctagagaact ggttggtggc gcagccggcg tcgtgcacgc agcagcgcgc gtcgttgttg 22440 

9ccagctgca ccacgctgcg cccccagcgg ttctgggtga tcttggcccg gttggggttc 22500 

tccttcagcg cgcgctgccc gttctcgctc gccacatcca tctcgatagt gtgctccttc 22560 
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tggatcatca 


cggtcccgtg 


caggcaccgc 


agcttgccct 


cggcttcggt 


gcagccgtgc 


22620 


agccacagcg 


cgcagccggt 


gcactcccag 


ttcttgtggg 


cgatctggga 


gtgcgagtgc 


22680 


acgaagccct 


gcaggaagcg 


gcccatcatc 


gcggtcaggg 


tcttgttgct 


ggtgaaggtc 


22740 


agcgggatgc 


cgcggtgctc 


ctcgttcaca 


tacaggtggc 


agatgcggcg 


gtacacctcg 


22800 


ccctgctcgg 


gcatcagctg 


gaaggcggac 


ttcaggtcgc 


tctccacgcg 


gtaccggtcc 


22860 


atcagcagcg 


tcatcacttc 


catgcccttc 


tcccaggccg 


aaacgatcgg 


caggctcagg 


22920 


gggttcttca 


ccgccattgt 


catcttagtc 


gccgccgccg 


aggtcagggg 


gtcgttctcg 


22980 


tccagggtct 


caaacactcg 


cttgccgtcc 


ttctcgatga 


tgcgcacggg 


gggaaagctg 


23040 


aagcccacgg 


ccgccagctc 


ctcctcggcc 


tgcctttcgt 


cctcgctgtc 


ctggctgatg 


23100 


tcttgcaaag 


gcacatgctt 


ggtcttgcgg 


ggtttctttt 


tgggcggcag 


aggcggcggc 


23160 


gatgtgctgg 


gagagcgcga 


gttctcgttc 


accacgacta 


tttcttcttc 


ttggccgtcg 


23220 


tccgagacca 


cgcggcggta 


ggcatgcctc 


ttctggggca 


gaggcggagg 


cgacgggctc 


23280 


tcgcggttcg 


gcgggcggct 


ggcagagccc 


cttccgcgtt 


cgggggtgcg 


ctcctggcgg 


23340 


cgctgctctg 


actgacttcc 


tccgcggccg 


gccattgtgt 


tctcctaggg 


agcaacaaca 


23400 


agcatggaga 


ctcagccatc 


gtcgccaaca 


tcgccatctg 


cccccgccgc 


caccgccgac 


23460 


gagaaccagc 


agcagaatga 


aagcttaacc 


gccccgccgc 


ccagccccac 


ctccgaqgcc 


23520 


gcggccccag 


acatgcaaga 


gatggaggaa 


tccatcgaga 


ttgacctggg 


ctacgtgacg 


23580 


cccgcggagc 


acgaggagga 


gctggcagcg 


cgcttttcag 


ccccggaaga 


gaaccaccaa- 


23640 


gagcagccag 


agcaggaagc 


agagaacgag 


cagaaccagg 


ctgggcacga 


gcatggcgac 


23700 


tacctgagcg 


gggcagagga 


cgtgctcatc 


aagcatctgg 


cccgccaatg 


catcatcgtc 


23760 


aaggacgcgc 


tgctcgaccg 


cgccgaggtg 


cccctcagcg 


tggcggagct 


cagccgcgcc 


23820 


tacgagcgca 


acctcttctc 


gccgcgcgtg 


ccccccaagc 


gccagcccaa 


cggcacctgt 


23880 


gagcccaacc 


cgcgcctcaa 


cttctacccg 


gtcttcgcgg 


tgcccgaggc 


cctggccacc 


23940 


taccacctct 


ttttcaagaa 


ccaaaggatc 


cccgtctcct 


gccgcgccaa 


ccgcacccgc 


24000 


gccgacgccc 


tgctcaacct 


gggccccggc 


gcccgcctac 


ctgatatcac 


ctccttggaa 


24060 


gaggttccca 


agatcttcga 


gggtctgggc 


agcgacgaga 


ctc gggccgc gaacgctctg 


24120 


caaggaagcg 


gagaggagca 


tgagcaccac 


agcgccctgg 


tggagttgga 


aggcgacaac 


24180 


gcgcgcctgg 


cggtcctcaa 


gcgcacggtc 


gagctgaccc 


acttcgccta 


cccggcgctc 


24240 


aacctgcccc 


ccaaggtcat 


gagcgccgtc 


atggaccagg 


tgctcatcaa 


gcgcgcctcg 


24300 
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cccctctcgg aggaggagat gcaggacccc gagagttcgg acgagggcaa gcccgtggtc 24360 

agcgacgagc agctggcgcg ctggctggga gcgagtagca ccccccagag cctggaagag 24420 

cggcgcaagc tcatgatggc cgtggtcctg gtgaccgtgg agctggagtg tctgcgccgc 24480 

ttctttgccg acgcggagac cctgcgcaag gtcgaggaga acctgcacta cctcttcagg 24S40 

cacgggttcg tgcgccaggc ctgcaagatc tccaacgtgg agctgaccaa cctggtctcc 24600 

tacatgggca tcctgcacga gaaccgcctg gggcaaaacg tgctgcacac caccctgcgc 24660 

ggggaggccc gccgcgacta catccgcgac tgcgtctacc tgtacctctg ccacacctgg 24720 

cagacgggca tgggcgtgtg gcagcagtgc ctggaggagc agaacctgaa agagctctgc 24780 

aagctcctgc agaagaacct caaggccctg tggaccgggt tcgacgagcg taccaccgcc 24840 

tcggacctgg ccgacctcat cttccccgag cgcctgcggo tgacgctgcg caacgggctg 24900 

cccgacttta tgagccaaag catgttgcaa aactttcgct ctttcatcct cgaacgctcc 24960 

gggatcctgc ccgccacctg ctccgcgctg ccctcggact tcgtgccgct gaccttccgc 25020 

gagtgccccc cgccgctctg gagccactgc tacttgctgc gcctggccaa ctacctggcc 25080 

taccactcgg acgtgatcga ggacgtcagc ggcgagggtc tgctggagtg ccactgccgc 25140 

.tgcaacctct gcacgccgca ccgctccctg gcctgcaacc cccagctgct . gagcgagacc 252.00 

cagatcatcg gcaccttcga gttgcaaggc cccggcgacg gcgagggcaa ggggggtctg 25260 

aaactcaccc cggggctgtg gacctcggcc tacttgcgca agttcgtgcc cgaggactac 2 5320 

catcccttcg agatcaggtt ctacgaggac caatcccagc cgcccaaggc cgagctgtcg 25380 

gcctgcgcca tcacccaggg ggccatcctg gcccaattgc aagccatcca gaaatcccgc 25440 

caagaatttc tgctgaaaaa gggccacggg gtctacttgg acccccagac cggagaggag 25500 

ctcaacccca gcttccccca ggatgccccg aggaagcagc aagaagctga aagtggagct 25560 

gccgccgccg gaggatttgg aggaagactg ggagagcagt caggcagagg aggaggagat 25620 

ggaagactgg gacagcactc aggcagagga ggacagcctg caagacagtc tggaggagga 25680 

agacgaggtg gaggaggcag aggaagaagc agccgccgcc agaccgtcgt cctcggcgga 25740 

gaaagcaagc agcacggata ccatctccgc tccgggtcgg ggtcgcggcg gccgggccca 25800 

cagtaggtgg gacgagaccg ggcgcttccc gaaccccacc acccagaccg gcaagaagga 25860 

9cggcagg ga tacaagtcct ggcgggggca caaaaacgcc atcgtctcct gcttgcaagc 25920 

ctgcgggggc aacatctcct tcacccggcg ctacctgctc ttccaccgcg gggtgaactt 25980 
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cccccgcaac atcttgcatt actaccgcca cctccacagc ccc t a ct act 9tttccaa9a 26040 
agaggcagaa acccagcagc agcagaaaac cagcggcagc agcagctaga aaatccacag 26X00 
cggcggcagg tggactgagg atcgcggcga acgagccggc gcagacccgg gagctgagga 26X60 
accggatctt tcccaccctc tatgccatct tccagcagag t cgg gggcag gagcaggaac 26220 
tgaaagtcaa gaaccgttct ctgcgctcgc tcacccgcag ttgtctgtat cacaagagcg 26280 
aagaccaac t tcagcgcact ctcgaggacg ccgaggctct cttcaacaag tactgcgcgc 26340 
tcactcttaa agagtagccc gcgcccgccc acacacggaa aaaggcggga attacgtcac 26400 
cacctgcgcc cttcgcccga ccatcatgag caaagagatt cccacgcctt acatgtggag 26460 
ctaccagccc cagatgggcc tggccgccgg cgccgcccag gactactcca cccgcatgaa 26520 
ctggctcagt gccgggcccg cgatgatctc acgggtgaat ga c atCC gcg ccc ac cg aaa 26580 
ccagatactc ctagaacagt cagcgatcac cgccacgccc cgccatcacc ttaatccgcg 26640 
taattggccc gccgccctgg t gtaccagga aactccccag cccacgaccg tactacttcc 26700 
3 cgagacgcc caggccgaag tC cagc t gac taac.caggt g tC cag Ct gg ccggcggcgc 26760 
cgccctgtgt cgtcaccgcc ccgctcaggg tataaa9cgg ctggtgatcc gaggca3agg 2 ^ 
cacacagctc aacgacgagg tggtgagctc ttcgctgggt ctgcgacctg acggagtctt 26880 
ccaactcgcc ggatcgggg. g atcttcctt cacgcctcgt caggccgtcc tgacttfcgga ^ 
Sagttcgtcc tcgcagcccc gctcgggcgg catcggc.ct ctccagttcg tggaggagtt 27000 
cactccctcg g tct ac t tca accccttctc cggctccccc ggccactacc cggacgagtt 27060 
catcccgaac ttcgacgcca tcagcgagtc ggt gg acggc taC g attgaa tgtcccatgg 2?120 
tsgcgcagct gacctagctc ggcttcgaca cc t ggaccac tgccgccgct tccgctgctt 27180 
cgctcgggat ctcgccgagt ttgcctactt tgagctgccc gaggagcacc ctcagggccc 27240 
agcccacgga gtgcggatca tcgtcgaagg gggcctcgac tcccacctgc ttcggatctt 27300 
cagccagcga ccgatcctgg tcgagcgcga acaaggacag acccttctta ctttgtactg 27360 
catctgcaac caccccggcc tgcatgaaag tctttgttgt c t gctg t gta ctgagtataa 27420 
taaaagctga gatc.gcgac tactccggac tcgattgtgg tgttcctgct atcaaccggt 27480 
ccctgttctt caccggg aac gagaccgagc tccagctcca g t g taa g ccc cacaa g aa g t 27540 
accccacctg gctgttccag ggctccccga tcgccgttgt caaccactgc gacaacgacg 27600 
Sagtcctgct gagcggccct gccaacctta ctttttccac ccgcagaagc aagctccagc 27660 
tcttccaacc cttcctcccc gggacctatc agtgcgtctc aggaccctgc catcacacct 27720 
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tccacc t3at ccc 9aat acc aca 9 c 9 cc g c tccccgctac taacaaccaa actacccacc 2 7780 

aac 3 ccacc g tcg c 3 acc t c tcctctgaat ctaataccac tacc gg a 9gt gag c t c= g a g 2,840 

gtcgaccaac cte tgggatt tactacggcc cctgggaggt ggtggggtta atagcgctag 2790Q 

9ccta gttgc gggtgggctt ttggttctct gctacctata cctcccttgc tgttcgtact 27g60 

ta gtggtgct 9 t g tt g ct gg tttaagaaat gggg aa g atc accctagtga g c tgcggtgc 28020 

9Ct gg t 93 c g gtgt t gCt tt c 3attgtggg actg33cggc gc33 c t3ta3 t 3aa33 a g a a 28080 

33 cc 3at ccc t g ct t3 catt tc aat ccc aa caaa tg cca 3 ct 3 a 3 t ttt o a 3 ccc 9 at 3g 28140 

caatc ggtg c gcggta ct ga tc aagtgcgg at gggaatgc 9 a g aac gtga gaatcgagta 28200 

caataacaag ac t c 3gaaca atactctcgc gtccg t g tgg ca g ccc gggg acccc 9 a 9tg 28260 

gtacaccgtc tctgtccccg S t 9 ct g ac gg ctccccgcgc acc 3 t 9 aata atactttcat 28320 

ttttgcgcac at gtg c aa ca c 9g tca tgtg gatg a g caa 9 cagtacgata tgtggccccc 28380 

cac 9 a agga9 aacatcgcgg cctcctccat cgcctacagc ctgtgcacgg cgccaatcac 2844Q 

c 3 c tatCgtg tgc ct gagC a ttcacatgcc catcgctatt cgccccagaa ataatgccga 28SO0 

gaaagagaaa ca g cca t aac ac gttttCtc acacaccttg tttttacaga caatgcgtct ^ 

gttaaatttt ttaaaca ttg tgctcagtat tgcttatgcc tctggttatg caaacataca 28620 

aaaaaccctt ta t g taggat ctgatggtac actagagggt acccaatcac aagccaaggfc ^ 

t9Mt " tat ttttata9M — ^ tccagttaaa ct ttgtaagg gtgaatCgcc 28740 

gcgtacacat aaaactccac ttacatttag tt g ca g ca at aataatctta cacttttttc 28800 

aa tt aca aaa caa t atac tg g tactt a t ta Ca g tacaaa c tttcaUcag gacaagaCaa 28 860 

at a ttatact 9 tt aaggtag aaaatcctac cactcctaga actaccacca ccaccactac 28920 

t 9 c a aagccc actg tgaaaa c Ca c aa ctag gaccaccaca actacagaaa ccaccaccag 28980 

c a c a acactt gctgcaacta cacacacaca cactaagcta accttacaga ccactaatga 29040 

tttgatcgcc ctgctgcaaa aggggg ataa cagcaccact tccaatgagg agatacccaa 29100 

atccatgatt ggcattattg ttgctgtagt ggtgtgcatg ttgatcatcg ccttgtgcat 29160 

ggCg tac t a t 9 cc tt ct 3 ct acagaaagca cagactgaac 3 acaa gctgg aacacttact 29220 

aa 3t3 tt 3aa ttttaatttt ttagaacc.t 3 a a3a tccta ggccttttta gtttttctat 29280 

cattacctct gctctttgtg aatcagtgga t.g.gatgtt a c tattaC c a ctggttctaa 29340 

ttatacactg aaagggccac cctcaggtat gctttcgtgg tattgctatt ttggaactga 29400 
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cactgatcaa actgaattat gcaattttca aaaaggcaaa acctcaaact ctaaaatctc 29460 

taattatcaa tgcaatggca ctgatctgat actactcaat gtcacgaaag catatggtgg 29520 

cagttattat tgccctggac aaaacactga agaaatgatt ttttacaaag tggaagtggt 295B0 
tgatcccac.t acaccaccca ccaccacaac tattcatacc acacacacag aacaaacacc. 29640 

agaggcaaca gaagcagagt tggccttcca ggttcacgga gattcctttg ctgtcaatac 29700 

ccctacaccc gatcagcggt gcccggggcc gctagtcagc ggcattgtcg gtgtgctttc 29760 

gggattagca gtcataatca tctgcatgtt catctttgct tgctgctata gaaggcttta 29820 

ccgacaaaaa tcagacccac tgctgaacct ctatgtttaa ttttttccag agccatgaag 29880 

gcagttagcg ctctagtttt ttgttctttg attggcattg tttttaatag taaaattacc 29940 

agagttagct ttattaaaca tgttaatgta actgaaggag ataacatcac actagcaggt 30000 

gtagaaggtg ctcaaaacac cacctggaca aaataccatc taggatggag agatatttgc 30060 

acctggaatg taacttatta ttgcatagga gttaatctta ccattgttaa cgctaaccaa 30120 

tctcagaatg gg ttaatt aa aggacagagc gttagcgtga ccagtgatgg gtactatacc 30160 

cagcatagtt ttaactacaa cattactgtc ataccactgc ctacgcctag cccacctagc 30240 

actaccacac agacaaccac atacagtaca- tcaaatcagc ctaccaccac tacagcagca 30300 

gaggttgcca gctcgtctgg ggtccgagtg gcatttttga tgttggcccc atctagcagt 30360 

cccactgcta gtaccaatga gcagactact gaatttttgt ccactgtcga gagccacacc 30420 

acagctacct ccagtgcctt ctctagcacc gccaatctct cctcgctttc ctctacacca 30480 

atcagccccg ctactactcc tagccccgct cctcttccca ctcccctgaa gcaaacagac 30540 

ggcggcatgc aatggcagat caccctgccc attgtgatcg ggttggtcat cctggccgtg 30600 

ttgctctact acatcttctg ccgccgcatt cccaacgcgc accgcaagcc ggcctacaag 30660 

cccatcgtta tcgggcagcc ggagccgctt caggtggaag ggggtctaag gaatcttctc 30720 

ttctctttta cagtatggtg attgaactat gattcctaga caattcttga tcactattct 30780 

tatctgcctc ctccaagtct gtgccaccct cgctctggtg gccaacgcca gtccagactg 30840 

tattgggccc ttcgcctcct acgtgctctt tgccttcgtc acctgcatct gctgctgtag 30900 

catagtctgc ctgcttatca ccttcttcca gttcattgac tggatctttg tgcgcatcgc 30960 

ccacctgcgc caccaccccc agtaccgcga ccagcgagtg gcgcagctgc tcaggctcct 31020 

ctgataagca tgcgggctct gctacttctc gcgcttctgc tgttagtgct cccccgtccc 31080 

gtcgaccccc ggtcccccac tcagtccccc gaggaggttc gcaaatgcaa attccaagaa 31140 
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ccc tgga aat tcct caaa t9 ct a Ccg ccaa aaa tC a g aca tgC a t ccca g ctgg atca tg 312 00 
atca ttggga tcgt3aacat tctggcctgc accctcatct cctttgtgat ttaccccfcgc 3i26Q 

ttt g ac tttg gttggaactc gcca9aggcg ctctafcctcc cgccfcgaacc ^^^^^ 3i32o 

ccaca gt? a 9 c aacc t ca ggc acacgcacta ccacca.cac a 3 c, t a g9 cc acaa t aca tg 3X380 

cccata ttag ac, atgagg c c 9 a g ccaca g c g acccat gc tcccc.ctat tagttacttc 3X440 

aa t ctaacc 3 g c 3g a gatg a ct 9 acccac t ggcc aa t aac aacgtcaacg accttctcct 3X500 

9 g aca tggac ggccgcgcct cggagcagcg actcgcccaa ctfccgcattc gtcagcagca 3i56Q 

9 3 a gagag cc g tcaa ggag c tgcagg ac gg cat a g ccatc cacca gtg ca a g a 9 a gg cat 3X6.0 

c tt c tgcctg gtga aaca gg ccaagatctc ctac gagg tc accca 9 acc g accatcgcct 3X680 

ctcc t ac 9 a 9 ctcctgca.c ag c g cca 9 aa gtt cacc tg c c tg3tcgg a g tcaaccccat 3X740 

cgtcatcaoc ca g ca gtcgg gcgata ccaa g939tg catc cac tg ctc Ct g c 3 actcccc 3X800 

c g ac tgcgt c cacactc tg a tcaa g accct ct g c 9g cctc c 9 c 9 acc t cc tccccat g aa 3X860 

ctaatcaccc cc t tatcca 9 tg aaataaa 3 a tC ata ttga c g at gattC a aataaaaaaa 3X 920 

ataatcattt g a tttga aat aaa gata caa tca t a ttgat gatttgagtt taacaaaaat 31980 

aaagaatcac ttac tt9 aaa tc tgatacca ggt ctct gt c cat gt tttc t 9 cc aaca cc a - 32 040 

cctcactccc c tcttC cc ag ctcCggtact gcaggccccg gcgggct9ca aacttcctc<= ^ 

acac gctgaa g3ggatgtca aattcctcct gfcccctcaat cttcatfctta tcfctctatca 32i6fl 

9at 3 tcc aaa aa 3 c gcgt cc 9ggtggatga tgacttcgac cccgCctacc cctacgatgc ^ 

a 9 acaac 9 ca cc g acc gtgc cc tt ca t caa cccccccttc gtCt c t tca g at ggattC ca 322 80 

a gag aa 9 ccc ct gggggtgt tgtccctgcg actggcCgac cccgtcacca ccaagaacgg ^ 

g9 aaa t c a cc c t c aagct99 gagagggg9t ggacctcgac tcgtcggg aa aactcatctc 32 400 

caacac g9 cc accaa 9g cc 9 cc.cccctct ca gtatttca aacaacacca tttcccttaa ^ 

aactgctgcc cc CttctaC a acaacaat gg aactttaagc ctcaat g tc t ccacaccatt 32520 

a 3 ca gtattt cccacattta acacttta gg cataagtctt gg aaac ggtc ttcagacttc 3.580 

aaa t aa gtt9 ttgactgtac aactaactca tcctcttaca ttcagctcaa atagcafccac ^ 

agtaaaaaca g acaaa 9ggc tatatattaa c t cca gtgga aaca g a ggac tt ga3gct a a 32 700 

tataagccta aaa agaggac tagcttttga cggCaatgct attgcaacat atattggaaa ^ 

t 99 c tta9ac ta t33at ctt a t 3 ata 9tg a t 93 aaaaaca a 3 accc gtaa ttaccaaaat 3 2820 
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tggagcagga ttaaattttg atg c t aacaa agcaatagct gtcaaactag gcacagg ttt 32880 

aagttttgac tccgctggtg ccttgacagc tggaaacaaa caggatgaca agctaacact 32940 

ttggactacc cctgacccaa gccctaattg tcaa tt ac tt tcagacagag atgccaaatt 33000 

..tactctctgt cttacaaaat. gcggtagtca aatactaggc actgtggcag tggcggctgt 33060 

tactgtagga tcagcactaa atccaattaa tgacacagtc aaaagcgcca tagttttcct 33 12 0 

tagatttgat tccgatggtg tactcatgtc aaactcatca atggtaggtg attactggaa 33180 

ctttagggag ggacagacca ctcaaagtgt agcctataca aatgctgtgg gattcatgcc 33240 

aaatataggt gcatatccaa aaacccaaag taaaacacct aaaaatagca tagtcagtca 33300 

ggtatattta actggagaaa ctactatgcc aatgacacta accataactt tcaatggcac 33360 

tgatgaaaaa gacacaaccc cagttagcac ctactctatg acttttacat ggcagtggac 33420 

tggagactat aaggacaaaa atattacctt tgctaccaac tcattctctt tttcctacat 33480 

cgcccaggaa taatcccacc cagcaagcca accccttttc ccaccacctt tgtctatatg 33540 

gaaactctga aacagaaaaa taaagttcaa gtgttttatt gaatcaacag ttttacagga 33600 

ctcgagcagt tatttttcct ccaccctccc aggacatgga atacaccacc ctctcccccc 33660 

gcacagcctt gaacatctga atgccattgg tgatggacat gcttttggtc tccacgttcc . 33720 

acacagttcc agagcgagcc agtctcggat cggtcaggga gatgaaaccc tccgggcact 33780 

cccgcatctg cacctcacag ctcaacagct gagga tt g tc ctcggtggtc gggatcacgg 33840 

ttatctggaa gaagcagaag agcggcggtg ggaatcatag tccgcgaacg ggatcggccg 33900 

gtggtgtcgc atcaggcccc gcagcagtcg ctgccgccgc cgctccgtca agctgctgct 33960 

cagggggttc gggtccaggg actccctcag catgatgccc acggccctca gcatcagtcg 34020 

tctggtgcgg cgggcgcagc agcgcatgcg aatctcgctc aggtcactgc agtacgtgca 34080 

acacaggacc accaggttgt tcaacagtcc atagttcaac acgctccagc cgaaactcat 34140 

cgcgggaagg atgctaccca cgtggccgtc gtaccagatc ctcaggtaaa tcaagtggcg 34200 

ctccctccag aagacgctgc ccatgtacat gatctccttg ggcatgtggc ggttcaccac 34260 

ctcccggtac cacatcaccc tctggttgaa catgcagccc cggatgatcc tgcggaacca 34320 

cagggccagc accgccccgc ccgccatgca gcgaagagac cccggatccc ggcaatgaca 34380 

atggaggacc caccgctcgt acccgtggat catctgggag ctgaacaagt ctatgttggc 34440 

acagcacagg catatgctca tgcatctctt cagcactctc agctcctcgg gggtcaaaac 34500 

catatcccag ggcacgggga actcttgcag gacagcgaac cccgcagaac agggcaatcc 34560 
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tcgcacataa cttacattgt gcatggacag ggtatcgcaa tcaggcagca ccgggtgatc 34620 

ctccaccaga gaagcgcggg tctcggtctc ctcacagcgt ggtaaggggg ccggccgata 34680 

cgggtgatgg cgggacgcgg ctgatcgtgt tctcgaccgt gtcatgatgc agttgctttc 34740 

ggacattttc gtacttgctg tagcagaacc tggtccgggc gctgcacacc gatcgccggc 34800 

99cggtctcg gcgcttggaa cgctcggtgt taaagttgta aaacagccac tctctcagac 34860 

cgtgcagcag atctagggcc tcaggagtga tgaagatccc atcatgcctg atagctctga 34920 

tcacatcgac caccgtggaa tgggccaggc ccagccagat gatgcaattt tgttgggttt 34980 

cggtgacggc gggggaggga agaacaggaa gaaccatgat taacttttaa tccaaacggt 35040 

ctcggagcac ttcaaaatga aggtcacgga gatggcacct ctcgcccccg ctg C g tt g gt 3 5100 

ggaaaataac agccaggtca aaggtgatac ggttctcgag atgttccacg gtggcttcca 35X60 

gcaaagcctc cacgcgcaca tccagaaaca agacaatagc gaaagcggga gggttctcta 35220 

attcctcaac catcatgtta cactcctgca ccatccccag ataattttca tttttccagc 35280 

cttgaatgat tcgaactagt tcctgaggta aatccaagcc agccatgata aaaagctcgc 35340 

gcagagcacc ctccaccggc attcttaagc acaccctcat aattccaaga tattctgctc 35400 

ctggttcacc tgcagcagat tgacaagcgg aatatcaaaa tctctgccgc gatccctgag- 35460 

ctcctccctc agcaataact gtaagtactc ttt catatcg tctccgaaac ttttagccat 35520 

aggaccccca ggaataagag aagggcaagc cacattacag ataaaccgaa gtccccccca 35580 

gtgagcattg ccaaatgtaa gattgaaata agcatgctgg ctagacccgg tgatatcttc 35640 

cagataactg gacagaaaat cgggtaagca atttttaaga aaatcaacaa aagaaaaatc 35700 

ttccaggtgc acgtttaggg cctcgggaac aacgatggag taagtgcaag gggtgcgttc 35760 

cagcatggtt agttagctga tctgtaaaaa aacaaaaaat aaaacattaa accatgctag 35620 

cctggcgaac aggtgggtaa atcgttctct ccagcaccag gcaggccacg gggtctccgg 35880 

cgcgaccctc gtaaaaattg tcgctatgat tgaaaaccat cacagagaga cgttcccggt 35940 

Sgccggcgtg aatgattcga gaagaagcat acacccccgg aacattggag Cccgtgagtg 36000 

aaaaaaagcg gccgaggaag caatgaggca ctacaacgct cactctcaag tccagcaaag 36060 

cgatgccatg cggatgaagc acaaaatttt caggtg Cgta aa aaatg t aa ttactcccct 36120 

cctgcacagg cagcgaagct cccgatccct ccagatacac atacaaagcc tcagcgtcca 36X80 

tagcttaccg agcggcagca gcagcggcac acaacaggcg caagagtcag agaaaagact 36240 
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gagctctaac ctgtccgccc gctctctgct caatatatag ccccagatct acactgacgt 36300 

aaaggccaaa gtctaaaaat acccgccaaa taatcacaca cgcccagcac acgcccagaa 36360 

accggtgaca cactcagaaa aatacgcgca cttcctcaaa cggccaaact gccgtcattt 36420 

ccgggttccc acgctacgtc atcaaaacac gactttcaaa ttccgtcgac cgttaaaaac 36480 

atcacccgcc ccgcccctaa cggtcgccgc tcccgcagcc aatcaccttc ctccctcccc 36540 

aaattcaaac agctcatttg catattaacg cgcaccaaaa gtttgaggta tattattgat 36600 



gatg 



36604 



<210> 3 
<211> 36535 
<212> DNA 

<213> chimpanzee adenovirus serotype Pan7 
<400> 3 

catcatcaat aatatacctc aaacttttgg tgcgcgttaa tatgcaaatg agctgtttga 60 

atttggggag ggaggaaggt gattggccga gagacgggcg accgttaggg gcggggcggg 120 

tgacgttttt aatacgtggc cgtgaggcgg agccggtttg caagttctcg tgggaaaagt 180 

gacgtcaaac gagg t g tggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 240 

ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 3.00 

aatgaggaag tgaaaatctg agtaatttcg cgtttatggc agggaggagt atttgccgag 360 

ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacctaaat 420 

ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggc gtcagctgat cgccagggta 480 

tttaaacctg cgctctctag tpaagaggcc actcttgagt gccagcgagt agagttttct 540 

cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga gacctgcccg 600 

gtaatgtttt cctggctact gggaacgaga ttctggaatt ggtggtggac gccatgatgg 660 

gtggcgaccc tcctgagccc cctaccccat ttgaggcgcc ttcgctgtac gatttgtatg 720 
atctggaggt ggatgtgccc gagaacgacc ccaacgagga ggcggtgaat gatttgttta . 780 

gcgatgccgc gctgctggct gccgagcagg ctaatacgga ctctggctca gacagcgatt 840 

cctctctcca taccccgaga cccggcagag gtgagaaaaa gatccccgag cttaaagggg 900 

aagagctcga cctgcgctgc tatgaggaat gcttgcctcc gagcgatgat gaggaggacg 960 

aggaggcgat tcgagctgca tcgaaccagg gagtgaaagc tgcgggcgaa agctttagcc 1020 

tggactgtcc tactctgccc ggacacggct gtaagtcttg tgaatttcat cgcatgaata 1080 
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« 99 a 9 at.a 9 ,a t9tgatg tgcgccetgt 9ctatatgag agcttacaac 

tsatt .. clt tagttgggaa 99cw ^ 

""" 9C " " 9 ""<" < ™< 9 a^a^a „, ct 

. 9 acca 9 « 9 „ 9t9 a 9 . gc c . cc999c99 t39agwct ^ 

«~«t. 999 a tg a.cc tt t99 .c tt9 t9taccC99a aac9ccccag 
ccaeaca C9t 9tgtttaett ^ 

•tc^t, . Mttaagtg c9Cg9ttcae ^^^^ t999wt5t ^ ^ 
9 « 99t9 ca 9 ,c« gtgtgg c „ 9eeoa9a gca99ietca t99a9atctg . gae 
9 « 9 ac tttc aoca 9 ac t a 9 ,ca 9 « 9 «, 9 , 9a a ctc . t m ^ 
t 99 . 9 *ct« 9 « tcggtgg gcctotagct awtagM at>gggccM ^ 
aa 99 .cc,. c „ gaggaeat cttgagagag cgtcocggta tctcaaettg 
99 =c*c. 9t «cac tte ,. coagagtatt ctgag , gcec 

. 9 aaccacc 9 c cgcggtagc ctttttCgcc t „ aeccttg ^ ^ 

999 , ctaccg t „ 99actgc ttagcag£ag ccttg£ggag aacatggagg 
t 9 a.t 9 o., e ctcc9gctae ttgccagcac agccggcaga ca=gc . 

t cc. 9C ca c o cca 99 aacae caac^cc. a 9 ca 9 c= 9c . 
~S=.. 9 a 99 . 99 a 9gagga tcgagaagag aacccgagag ccggtctg3a ^ 

*"* 9 " 9 « ac t , 99tctt 
c« 9t99 .c 9 99 . gaggg9g attaa9cg9g 

"=t 9 a« 9t c , gtctgatg ag „ gcag9c ggegt9gts3 eaegaggeec 

9 a Cagaegag gtctcggtga tgcatgagaa 
gceggag « t gaggatgatl gggaggtagc ^ 

«™ 9 .«.. C . t c a 9 a., K cc C 
9 c C ,c« tc a 999 ,a t9gg gccgaggtgg agatcagtac 

9 « 9 « 9t a t 9 , t9aatatg tacccggggg tg9tgggcat ^ acctteatsi 

™" SSt9 " t . tggocaac accaagctga 

-M— a t9ctccttc ttegggttca aEaaoaegeg catcgaggcc tg9g9c ^ 

t«c. 9C9 , 9 9g9 . egcagc ttttcagcoa actggatgg9 9gtcgegggc >g ^ g 
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9 caa ggt9tc a gtgaag aaa t g cc tgttcg ag a ggtg cca cc tgggg9tg atgagcgagg 
g c g aa g ccaa a gt caaacac tg c g cctcta ctgag ac ggg ct 3 c tttgtg ct g atcaa 9g 
3 caa t9 ccca a gt caa 9 cat aacatgatct gtgg9gcc tc gg a tgag c gc gg c t acca 9 a 
t3 ct g acc tg cg cc g9 t 9gg aacagccata tgctggccac cgtgcatgtg acctcgcacc 
=cc gcaaga c at gg ccc gag ttc gagca ca ac gtcatga c cc gatgca a t gtgc acct 99 
93 tccc 9 cc g a 9g ca tgttc atgccctacc a gtg caacat 9 caa tttgtg aaggtgctgc 
t 99 agccc g a tgccatgtcc a gagtg a 3 cc t gacggg3g t g ttt g acat g aatgtggagc 
t g t33 aaaat tc tgagatat gatgaatcca agaccaggtg ccgggcctgc gaatgcggag 
3 caagcac g c caggcttcag ccc gtgtgtg tggaggtgac ggaggacctg cgacccgatc 
atttggtgtt 3t cc tgca ac ggg ac 99 a gt tcggctccag c 9gggaagaa tctgactaga 
3 t 9 a 9t a 9tg ttt gggggag gtggagggct tgtacgaggg gcagaatgac taaaatctgfc 
gtttttct* gtgttgcagc a g cat g a gcg g a agcg cc t c ctt tgaggga ggggtattca 
3 ccc tta tc t 3a c ggggcgt ctcccctcct g9gcgggag t gcgt c agaat gtgatggga t 
ccac ggtgga c ggccgg ccc gt9 c ag ccc g cgaactcttc aaccctgacc tacgcgaccc 
tgagctcctc gt cc gtggac gcagctgccg cc g ca g c tgc tgcttccgcc g^cagcgccg 
t 9 c g c g9 aat 99 ccct gggc gccggctact acagctctct 3gtgg ccaac tcgacttcca 
ccaataatcc cgccagcctg aacgaggaga agctgctgct gctgatggcc cagctcgagg 
ccctgaccca gcgcctgggc gagctgaccc a gca g gtggc tcagctgcag gcggagacgc 
333 cc 9 c g9t tgccacggtg aaaacca aat aaaaaa t ga a C ca ataaat a aac 9g a 9 ac g 
gttflttg.tt ttaacacaga gtcttgaatc tttatttgat ttttcgcgcg cggtaggccc 
tggaccaccg gtctcgatca ttgagcaccc ggtgg.tttt ttccaggacc cggtagaggt 
333 ctt 9 gat gttgaggt ac at ggg cat ga gcccgtcccg 9gggCgg a 9g tagctccatt 
g cagggcctc gtgctcgggg gtgg tgttgt aaatcaccca gtcatagcag gggcgcaggg 
cgtggtgctg cacgatgtcc ttgaggagga gactgatggc cacgggcagc cccttggtgt 
a 33t3tC gac 9aa cc tg t tg ag ct gggagg g a tg catgc g gggggag a tg agatgcatct 
t 33 cct 99 at ctt g a 9 a ttg 3 c g a tgt tcc cgcccagatc cc 9 cc ggggg ttcatgttgt 
gcaggaccac ca g cac 9gtg tatccggtgc acttggggaa tttgtcatgc aacttggaag 
SSaaggcgtg aaagaatttg gagacgccct tgtgaccgcc c.ggttttcc at g Ca ctcat 
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ccat g a tgat 9gcgatgggc CC9tgggcgg cggcctgggc ^^^^ cg ^ ccgg 

acaca tC9 ta gttgtggtcc tgggtgagct cgtcataggc 

H 9 a 339tg cc c g act 93ggg acgaaggtgc cctcgatccc ^ 

a g atc t9 ca t ctccca ggC c ttgagctcgg agggggggat catgtccacc . tgcggggcga 
tg aaaaaaac ggtttccggg gcgggggaga 

9ct ggga c Ct gccg ca g cc g gtggggccgt agatgacccc gatgaccggc 
agtt g a ggga gagacagctg ccgtcctcgc ggaggagggg ggccacctcg ttcatcatct 
c g c 9Ca ca tg cat gttctcg cgcacgagtt ccgccaggag 
gg a gctCttg ca gcg a 9gcg aagtttttca gcggctfcgag 

t 9g a g a 3ggt ctgttgcaag agttccagac ggtcccagag ctcggtgafcg tgctct ^ 

CatCtC98tC Ca9C39aCCt CCtC9 "^ C ^" gggg cgactgcggg agfcagggcac 
ca ggcgatgg 9 c 9 tcca 3 c 3 aggccag9gt ccggfcccttc cagggtcgca 

ca 9 c gtggtc tcc gtC ac gg tgaaggggtg cgcgccgggc tgggcgcttg 

c t tca 99ct c atcc 9g c tgg tcgagaaccg ctcccggtcg gcgccctgcg 

9ta 9 caa ttg ag c atgagtt cgtagttgag 

c tt acc tttg gaagcgtgcc cgcagacggg acagaggagg gactfcgagg . 
99999c 3 a gg aagacggact cgggggcgta ggcgtccgcg ^^^^^ ^^^^ 

ctc 9 cac t cc ac 9 a 9 cca gg tgaggtcggg cc ggttgggg tcaaaaacga ggttt cctcc 
9t3 c tttttg atgcgtttct tacctcfcggt ^^^^ tcgtgtcccc 

aaa 9 a 99ctg tccgtgtccc cgtagaccga ^ 

9c ggt cctc g tcgtagagga accccgccca ctccgagacg aaggcccggg ^^^^^ 

cac g aa 9gag gccacgtggg ag9ggtagcg gtcgttgtcc accagcgggt ccaccttcfcc 
ca g99tatgc aagcacatgt ccccctcgfcc cacatcMgg ^^^^ ^^^^ 

9ta ggc ,ac g tgac c ggg9g tcccggccgg gggggtataa aagggggcgg gcccctgbfcc 
gtcctcact, tcttccggat cgctgtccag gagcgccagc ^^^^ ^^^^^ 
ctc gaaggct ggcataacct cggcactcag gttgtcagtt tctagaaacg aggaggafctt 
9atat t9 ac g gtg cc gttgg a g a cgc ct tt ca tgagcccc tcgtccatct ggtcagaaaa 
9ac g a tcttt ttgttgtcga gcttggtggc gaaggagccg tagagggcgc 
ctt ggcgatg gagcgcatgg tctggttctt ttcctfcgtcg gcgcgctcct fcggcggcgat 
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9ttg a 9ctgc ac gtactcg c gcg ccac g ca cttcc a ttcg gggaagacgg tggtgagcfcc ^ 
9t c 9gg cac 9 a ttctgaC cc g cca g cc 3 c g gttgtgcagg gtgatgaggt ccacgctggt 
gg cca C c tCg C c 9 c 9 ca ggg gctcgttggt ccagcagagg 

9aa 9gggggc agcgggtcca gpatgagctc gtcggggggg tcggcgtcca ^^^^ 
9cc g99 c aga agct c ggggt cgaagtagct gafcgcaggtg tccagatcgt ccagcgccgc 
t t9 cca gtcg c 9 cac 9gc ca 9 c gcg c 9 c t c gtaggggctg aggggcgtgc 
999 g t g c gtg a 9cg c ggagg C9 taca tg cc gcagatgtcg tagacgtaga ggggcfccctc 
9a 99 ac g cc g a tgtaggtgg ggtagcagcg ccccccgcgg atgctggcgc 
Stacagctc. tgc9 a gggcg cgaggagccc cgtgccgagg 

99c 9 c 9gtag acgatctggc ggaagafcggc gtgggagtfcg gaggagatgg ^^^^ 
9aa 9 a tgttg aagCgggcgt ggggcaggcc gaccgagtcc 
9 t =ct gcagc ttggcgacga gcfccggcggt tccagggcgc 
99 t ctc ttgg atgatgtcgt actfcgagctg gccctfcctgc ttccacagcfc 

aa ggaactct tcgcggt cc t tccagtactc ttcg a ggggg aacccgtcct gatcggcacg 
.9taa ga9 ccc acc atgtaga actggttgac ggcctfcgtag 

9999a gggCg taagcttgtg cggccttgcg cagggaggfcg ^^^^ 
9c g cac catg accttgagga actggtgctt gaagtcgagg tcgtcgcagc ^^^^ 

cca gagctgg aa 9t cc gt9c 9 c tt c ttgt a 99 c g99gttg ggcaaagcga aagtaacatc 
9t tg aa gagg a tcttg ccc g c 3 c g9ggcat gaagttgcga gtgatgcgga aaggctgggg 
cacc tcgg cc cggttgttga tgacctgggc ggcgaggacg atctcgtcga agccgttgat 
9t tgt9 ccc g ac 9atgtaga gttccacgaa tcgcgggcgg cccctaacgt ggggcagctt 
c ttg a 9ctcg tcgtaggtga gctcggcggg gfccgctgagc ccgtgctgcfc 7seo 
9 t c 9gcg ac g tgggggttgg cgctgaggaa ggaagtccag ^^^^ ccagggcggt 
c t9 caa 9cgg tcccggtact gacggaactg ctggcccacg gccacttttt 76BQ 
9ca gtag aa g gtgcgggggt cgccgtgcca gcggtcccac ^ 

9t999c g a g c tc gacgag c g gcgggtC ccc ggagagtttc atgaccagca tgaaggggac ^ 
9a gCtgcttg cc gaaggacc ccatccaggt gtaggttfccc acatcgtagg fcgaggaagag 786Q 
cc t ttc ggtg cgaggatgcg agccgafcggg gaagaacfcgg atctccfcgcc accagt ^ ^ 
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g 3 aa t3gct3 ttgatgtgat ggaagtagaa atgccgacgg ^^^^ 

gtgtttatac aa 3 c gt cc gc a gtgcC c g ca ac gctg cac g ggatg cac gt gctgcacgag 
c tg tacc tg3 gttcctttga cgaggaattt ^^^^ ^^^^ 

c tg3 t 3 c tgt ac ta c 9 tcc t 93C catc gg c 9 t gg ccat cg tc tg cctc g a t ggtggtcat 
g c tg ac 3 a 3 c cc gCgCg3g a 33 ca ggt cca gacttcggct cggacgggfcc ggagagcgag 
gac g a 333C3 c g c aggcc33 a 3 c t3CC ca 3 ggtcctgaga cgctgcggag fccaggtcagt 

9 3g ca gcgg c ggcg c g c 33t t3 ac tt9C a 3 3 a 3Cttttcc a 33gcg c g c g ggag3tC ca g 
a t33 tac tt3 a t ctccac 3g cgccgCtggt ggcgacgtcc 

cccc t33ggc g cc a ccacc g tgccccgttt c tt ct tgggc gctgc ttcca t g cc ggt ca g 
aa 3 c ggcggc gag3 ac g c gc 3 cc gggC33 c a 3333 c 3gct c 9gg accc gg a ggcaggggc 
9 3 ca gg9gC a c gt c ggcg cc g c g c g c 99gc a ggttctggt ac tgcg ccc g 9 a 9 a aga c tg 
gc 3t3a3C9a cg ac g c g ac g gttg ac 3tC c tggat c tg ac g cctct gggt gaaggccacg 
ggaccc 9tg a gtttga acc t 9 aaa gag a gt tcg aca g aat caa tCtC93t a t c 3 tt 3ac9 
gc 39 cc tg cc 9 ca 3g a tct c ttg cac gtcg ccc 9 a gCCgt cctggtaggc gatctcggtc 
at 3 aac tgct cgatctcctc c Ccctgaagg t c t cc gcggc . cggcgcgctc gacggtggcc 
3 c 3 a ggtcgt tggag a tgcg g ccca tg a 9 c t gcga9aagg cgttcatgcc ggcctcgttc 
ca 3 ac gcgg c tgt a g accac ggct cc gCcg gggtcgcgcg cgcgcatgac cacctgggcg 
a 33t t 3a3ct c 3a c 3t33C3 c 9tg a agaC c g c gtagttgc a 9 a 33 c 3 c t3 g ta g a 3gtag 
tt 3 a gcgtgg tggcg a tgtg ctcggtgacg aagaagfcaca tgafcccagcg gcggagcggc 
atctcgctga c gtcg ccca g ggc ttccaa g c 3 c t cca tgg cc t c 3 ta 3 aa 3t c Ca c 33 c 3 
aa gtt3aaa a act 33gagtt gcgcg cc g a g ac ggtcaact cctcctccag aa 3 ac 39 at g 
a 9 c t ca gcg a t ggtggcgcg cacctcgcgc tcgaaggccc cggggggcfcc 

atctcttcct cctccactaa c atctcttct acttcctcct c aggaggcgg cggcggggga 
ggg 3 ccc t3 c 3tC3 cc 9g c g g c g cac gggc a g ac ggtcg a t g aa gcgct c g at gg t ct cc 
CC9C9CC99C ctc ggtgacg gcgcgcccgt cctcgcgggg ccgcagcgtg 

aagacgccgc c gcg catctc c aggtggccg C c 9ggggggt ct cc gttggg ca ggga3agg 
gcgc tgacg a t9 catctta t caa ttgg ccc gtaggg a Ct c c gcg caa gga cct gag c gt c 
tc 3 a 3atC ca c gggat cc g a aaacc g c tg a acgaaggctC c 9 a g cca gtc g ca g tc g caa 
9 g t a ggctg a g ccc ggtttc ttgttcttcg gggafcttcgg gaggcgggcg ggcgatgctg 
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***** 99C99t « C9 agacggcgga tggcggcgag 9ago>cc>M 

tcc Ctg99 cc e M o ttg c tg 9 . tgcgoaga egcccc(ig9(: 3t9gcoct3i 

ccctMcg. ggtccttgta gca9Ccctgc at9a3ce9cc cc>cgggcic ctccecctcg 

oc^csc.c c^c, emcees .accent g9g9 ct 9g .c 9 a 9 c 9 cc ag9 

o 9 « gctc99c 9 , 9W99cc t9ct9tKct 999t9agg9t ^ 

tWWW C9,C9aa3C9 — ««» g ca gttgg « 10020 

.9«9.c 99C « g9tgg<:cg 9gtcgcacga 9ctcgC99ta cct9ag9c9c ioojo 

.WOT* gacgcagtcg tt9 „ 9gC9c 9cacga99ca ctggtatec9 ioim 

« 3 a»a. 9t « ggc99t . 9 agc99cc . tc 9ctcggt9gc imoo 

gg c 9 c g , 9gt e«c 9ag c t ga99c99t9g ca9cc9taga tgcacctgga cacccag9tg ioj6o 

*m**m* a,c t c 9 o 9g . C9e9gttcca 9at9ttgcgc lM20 

a**^. a g t, 9ttcat ggt99cc9C9 9tet99coog tgaggcgcgo gcag[cgt9g iojao 

" 9 ""* 9 * C ' CaC " 9Ca a "^"« «. tg9 c« 99 a g „„„ 
g «.. g c 3 aa £gcgcgtgta coccggttcg aatctcg>at 

~ 5 ea 9 c t .a C9t9gtaoeg « te «, ctw » t ,; JtctSttnt g , aacctcc „ w 

aW-OW WW*. mtgBaxt „ tgaaaaac eagcaagcgc io62o 

9 °"* 3C9 ' C C3CCC9C9 " »■«■*«■ ^«*> agaaag , ato gcca9ggtt9 10MO 

xttm. ^tc^ cctc^ cggccggatt cogcg9ctaa io74o 

•mow. 9 « g =ccc gt c gettc<:aag ac „ cccago cagccgactt ct _ gttac ioaoo 

^ """'"•= — ~9- t =~ gCactg c^.,.^ 

_ C cca ret c.a cc^cta, c 9 oc 9 „ gc . gcagcaacag ccggcgctcc ^ 

t.ecccc** cca 9 c. g c. g cc. gccaeta ^ 

«ca 9Mtga cctggccttg 9aaga9ggcg aggg9ccggc 9c99CC99gg gcgtcgCcgo iimo 

" C3C3C9,:9 C * 3 " 9 ""« ^ g t g ==caa g c iu.. 

. g .ac« 9tt c. gag . eagg agoggogagg 9atgc9cgcc tccegctccc iiim 

^c 9999cg ggagcC9cg9 W! „ accg . a , gcg g9t9ctgagg gacgaggatt iijzo 

C9a3 " 9 " 9 999 ' t °"» ~*~*=*= 0CC8.99CC ^cc.ce 

<**<*•»=•* .^t,..^ W<a cttcc „ aaa ccctccaac ,- 11MQ 
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accacgtgcg cacgctgatc gcgcgcgagg aggtgaccct gggcctgatg cacctgtggg H400 

acctgctgga ggccatcgtg cagaacccca cgagcaagcc gctgacggcg cagctgtttc 11460 

tggtggtgca gcacagtcgg gacaacgaga cgttcaggga ggcgctgctg aatatcaccg 11520 

agcccgaggg ccgctggctc ctggacctgg tgaacattct gcagagcatc gtggtgcagg l 158 0 

agcgcgggct gccgctgtcc gagaagctgg cggctatcaa cttctcggtg ctgagcctgg 11640 

gcaagtacta cgctaggaag atctacaaga ccccgtacgt gcccatagac aaggaggtga 11700 

agatcgacgg gttttacatg cgcatgaccc tgaaagtgct gaccctgagc gacgatctgg 1X760 

gggtgtaccg caacgacagg atgcaccgcg cggtgagcgc cagccgccgg cgcgagctga 11820 

gcgaccagga gctgatgcac agcctgcagc gggccctgac cggggccggg accgagggg g 11880 

agagctactt tgacatgggc gcggacctgc gctggcagcc cagccgccgg gccttggaag 11940 

ctgccggcgg ttccccctac gtggaggagg tggacgatga ggaggaggag ggcgagtacc 12000 

tggaagactg atggcgcgac cgtatttttg ctagatgcag caacagccac cgcctcctga 12060 

tcccgcgatg cgggcggcgc tgcagagcca gccgtccggc attaactcct cggacgattg 12120 

gacccaggcc atgcaacgca tcatggcgct gacgacccgc aatcccgaag cctttagaca 12180 

gcagcctcag gccaaccggc- tctcggccat cctggaggcc gtggtgccct cgcgctcgaa 12240 

ccccacgcac gagaaggtgc tggccatcgt gaacgcgctg gtggagaaca aggccatccg 12300 

cggcgacgag gccgggctgg tgtacaacgc gctgctggag cgcgtggccc gctacaacag 12360 

caccaacgtg cagacgaacc tggaccgcat ggtgaccgac gtgcgcgagg cggtg tC gca 12420 

9cgcgagcgg ttccaccgcg agtcgaacct gggctccatg gtggcgccga acgccttcct 12480 

gagcacgcag cccgccaacg tgccccgggg ccaggaggac tacaccaact tcatcagcgc 12540 

gctgcggctg atggtggccg aggtgcccca gagcgaggtg taccagtcgg ggccggacta 12600 

cttcttccag accagtcgcc aggg CttgC a gaccgtgaac ctgagccagg ctttcaagaa 12660 

cttgcaggga ctgtggggcg tgcaggcccc gg tC g 9gga c cgcgcgacgg tgtcgagcct 12720 

gctgacgccg aactcgcgcc tgctgctgct gctggtggcg cccttcacgg acagcggcag 12780 

cgtgagccgc gactcgtacc tgggctacct gcttaacctg taccgcgagg ccatcgggca 12840 

ggcgcacgtg gacgagcaga cctaccagga gatcacccac gtgagccgcg cgctgggcca 12900 

ggaggacccg ggcaacctgg aggccaccct gaacttcctg ctgaccaacc ggtcgcagaa 12960 

gatcccgccc cagtacgcgc tgagcaccga ggaggagcgc atcctgcgct acgtgcagca 13020 

9agcgtgggg ctgttcctga tgcaggaggg ggccacgccc agcgccgcgc tcgacatgac 13060 
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cgcgcgcaac atggagccca gcatgtacgc tcgcaaccgc ccgttcatca ataagctgat 13140 

ggactacttg catcgggcgg ccgccatgaa ctcggactac tttaccaacg ccatcttgaa 13200 

cccgcactgg ctcccgccgc ccgggttcta cacgggcgag tacgacatgc ccgaccccaa 13260 

cgacgggttc ctgtgggacg acgtggacag cagcgtgttc tcgccgcgcc ccgccaccac 13320 

cgtgtggaag aaagagggcg gggaccggcg gccgtcctcg gcgctgtccg gtcgcgcggg 13380 

tgctgccgcg gcggtgcctg aggccgccag ccccttcccg agcctgccct tttcgctgaa 13440 

cagcgtgcgc agcagcgagc tgggtcggct gacgcggccg cgcctgctgg gcgaggagga 13500 

gtacctgaac gactccttgt tgaggcccga gcgcgagaag aacttcccca ataacgggat 13560 

agagagcctg gtggacaaga tgagccgctg gaagacgtac gcgcacgagc acagggacga 13620 

gccccgagct agcagcagcg caggcacccg tagacgccag cgacacgaca ggcagcgggg 13680 

tctggtgtgg gacgatgagg attccgccga cgacagcagc gtgttggact tgggtgggag 13740 

tggtggtggt aacccgttcg ctcacttgcg cccccgtatc gggcgcctga tgtaagaatc 13800 

tgaaaaaata aaaaacggta ctcaccaagg ccatggcgac cagcgtgcgt tcttctctgt 13860 

tgtttgtagt agtatgatga ggcgcgtgta cccggagggt cctcctccct cgtacgagag 13920 

cgtgatgcag caggcggtgg cggcggcgat gcagcccccg ctggaggcgc cttacgtgcc 13980 

cccgcggtac ctggcgccta cggaggggcg gaacagcatt cgttactcgg agctggcacc 14040 

cttgtacgat accacccggt tgtacctggt ggacaacaag tcggcggaca tcgcctcgct 14100 

gaactaccag aacgaccaca gcaacttcct gaccaccgtg gtgcagaaca acgatttcac 14160 

ccccacggag gccagcaccc agaccatcaa ctttgacgag cgctcgcggt ggggcggcca 14220 

gctgaaaacc atcatgcaca ccaacatgcc caacgtgaac gagttcatgt acagcaacaa 14280 

gttcaaggcg cgggtgatgg tctcgcgcaa gacccccaat ggggtcgcgg tggatgagaa 14340 

ttatgatggt agtcaggacg agctgactta cgagtgggtg gagtttgagc tgcccgaggg 14400 

caacttctcg gtgaccatga ccatcgatct gatgaacaac gccatcatcg acaactactt 14460 

ggcggtgggg cgtcagaacg gggtgctgga gagcgacatc ggcgtgaagt tcgacacgcg 14520 

caacttccgg ctgggctggg accccgtgac cgagctggtg atgccgggcg tgtacaccaa 14580 

cgaggccttc caccccgaca tcgtcctgct gcccggctgc ggcgtggact tcaccgagag 14640 

ccgcctcagc aacctgctgg gcatccgcaa gcggcagccc ttccaggagg gcttccagat 14700 

cctgtacgag gacctggagg ggggcaacat ccccgcgctc ttggatgtcg aagcctatga 14760 
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gaaaagcaag gaggaggccg ccgcagcggc gaccgcagcc gtggccaccg cctctaccga 14820 

ggtgcggggc gataattttg ctagcgccgc ggcagtggcc gaggcggctg aaaccgaaag 14880 

taagatagtc atccagccgg tggagaagga cagcaaggac aggagctaca acgtgctcgc 14940 

ggacaagaaa aacaccgcct accgcagctg gtacctggcc tacaactacg gcgaccccga 15000 

gaagggcgtg cgctcctgga cgctgctcac cacctcggac gtcacctgcg gcgtggagca 15060 

agtctactgg tcgctgcccg acatgatgca agacccggtc accttccgct ccacgcgtca 1S120 

agttagcaac tacccggtgg tgggcgccga gctcctgccc gtctactcca agagcttctt 1S180 

caacgagcag gccgtctact cgcagcagct gcgcgccttc acctcgctca cgcacgtctt 15240 

caaccgcttc cccgagaacc agatcctcgt ccgcccgccc gcgcccacca ttaccaccgt 15300 

cagtgaaaac gttcctgctc tcacagatca cgggaccctg ccgctgcgca gcagtatccg 15360 

gggagtccag cgcgtgaccg tcactgacgc cagacgccgc acctgcccct acgtctacaa 15420 

ggccctgggc gtagtcgcgc cgcgcgtcct ctcgagccgc accttctaaa aaatgtccat 15480 

tctcatctcg cccagtaata acaccggttg gggcctgcgc gcgcccagca agatgtacgg 15540 

aggcgctcgc caacgctcca cgcaacaccc cgtgcgcgtg cgcgggcact tccgcgctcc 15600 

ctggggcgcc ctcaagggcc gcgtgcgctc gcgca.ccacc gtcgacgacg tgatcgacca 15660 

ggtggtggcc gacgcgcgca actacacgcc cgccgccgcg cccgcctcca ccgtggacgc .15720 

cgtcatcgac agcgtggtgg ccgatgcgcg ccggtacgcc cgcgccaaga gccggcggcg 15780 

gcgcatcgcc cggcggcacc ggagcacccc cgccatgcgc gcggcgcgag ccttgctgcg 15840 

cagggccagg cgcacgggac gcagggccat gctcagggcg gccagacgcg cggcctccgg 15900 

cagcagcagc gccggcagga cccgcagacg cgcggccacg gcggcggcgg cggccatcgc 15960 

cagcatgtcc cgcccgcggc gcggcaacgt gtactgggtg cgcgacgccg ccaccggtgt l60 20 

gcgcgtgccc gtgcgcaccc gcccccctcg cacttgaaga tgctgacttc gcgatgttga 16080 

tgtgtcccag cggcgaggag gatgtccaag cgcaaataca aggaagagat gctccaggtc 16140 

atcgcgcctg agatctacgg ccccgcggtg. aaggaggaaa gaaagccccg caaactgaag 16200 

cgggtcaaaa aggacaaaaa ggaggaggaa gatgtggacg gactggtgga gtttgtgcgc 16260 

gagttcgccc cccggcggcg cgtgcagtgg cgcgggcgga aagtgaaacc ggtgctgcgg 16320 

cccggcacca cggtggtctt cacgcccggc gagcgttccg gctccgcctc caagcgctcc 16380 

tacgacgagg tgtacgggga cgaggacatc ctcgagcagg cggtcgagcg tctgggcgag 16440 

tttgcttacg gcaagcgcag ccgccccgcg cccttgaaag aggaggcggt gtccatcccg 16500 
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ctqqaccaca 




gccgagcctg 


aagccggtga 


ccctgcagca 


ggtgctgccg 


16560 


EQ C Q CQQ CQC 


^y^-yt-cyggg 


ct teaagege 


gagggeggeg 


aggatctgta 


cccgaccatg 


16620 


raoc toa too 


tgcccaagcg 


ccagaagctg 


gaggacgtgc 


tggagcacat 


gaaggtggac 


16680 


c c ca aaa tor 


^ycccgaggc 


caaggtgegg 


cccatcaagc 


aggtggcccc 


gggcctgggc 


16740 


atacaaarrn 


c-ggacaccaa 


gatccccacg 


gageccatgg 


aaaegcagae 


cgagcccgtg 


16800 




ccagcaccat 


ggaggtgcag 


acggatccct 


ggatgeegge 


gccggcttcc 


16860 


dCCdClCyCC 


gaagacgcaa 


gtacggcgcg 


gccagcctgc 


tgatgcccaa 


ctacgcgctg 


16920 




tcatccccac 


gccgggctac 


cgcggcacgc 


gcttctaccg 


cggctacacc 


16980 


a a o si ^* « « 


gcaagaccac 


cacccgccgc 


cgccgtcgtc 


gcacccgccg 


cagcagcacc 


17040 


yCydCCCCCy 


ccgccgccct 


ggtgcggaga 


gtgtaccgca 


gegggegega 


gcctctgacc 


17100 


v- uy i^wy <„yc.y 


cgcgctacca 


cccgagcatc 


gecatttaac 


tctgccgtcg 


cctcctactt 


17160 


yuoyaLatyy 


ccctcacatg 


ccgcctccgc 


gtccccatta 


cgggctaccg 


aggaagaaag 


17220 


*-v_y tyceyea 


gaaggctgac 


ggggaacggg 


ctgcgtcgcc 


atcaccaccg 


gcggcggcgc 


17280 




agcggttggg 


gggaggcttc 


ctgcccgcgc 


tgatccccat 


catcgccgcg 


17340 


yegacegggg 


cgatccccgg 


catagcttcc 


gtggcggtgc 


aggcctctca 


gcgccactga 


.17400 


ya\»a\.ayLUt 


ggaaaat ttg 


taataaaaaa 


atggactgac 


gctcctggtc 


ctgtgatgtg 


17460 


tQtttttaoA 


tggaagacat 


caatttttcg 


tccctggcac 


cgcgacacgg 


cacgcggccg 


17520 


t" 1 1* A t nnnr a 


ectggagega 


catcggcaac 


agecaactga 


aegggggege 


cttcaattgg 


17580 




ggageggget 


taagaatttc 


gggtccacgc 


tcaaaaccta 


tggcaacaag 


17640 


gcgtggaaca 


gcagcacagg 


geaggegctg 


agggaaaagc 


tgaaagagca 


gaacttccag 


17700 


cagaaggegg 


tegatggect 


ggcctcgggc 


ateaaegggg 


tggtggacct 


ggccaaccag 


17760 


geegtgeaga 


aacagatcaa 


cagccgcctg 


gacgcggtcc 


cgcccgcggg 


gtccgtggag 


17820 


acgccccagg 


tggaggagga 


gctgcctccc 


ctggacaagc 


gcggcgacaa 


gcgaccgcgt 


17880 


cccgacgcgg 


aggagacget 


gctgacgcac 


aeggacgage 


cgcccccgta 


egaggaggeg 


17940 


gtgaaactgg 


gtctgcccac 


cacgcggccc 


gtggcgcctc 


tggccaccgg 


ggtgctgaaa 


18000 


cccagcagca 


gcagccagcc 


cgcgaccctg 


gacttgcctc 


cgcctgcttc 


ccgcccctcc 


18060 


acagtggcta 


agcccctgcc 


gccggtggcc 


gtcgcgtcgc 


gcgccccccg 


aggccgcccc 


18120 


caggegaact 


ggcagagcac 


tctgaacagc 


atcgtgggtc 


tgggagtgca 


gagtgtgaag 


18180 
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cgccgccgct gctattaaaa gacactgtag cgcttaactt gcttgtctgt gtgtatatgt 18240 

atgtccgccg accagaagga ggaagaggcg cgtcgccgag ttgcaagatg gccaccccat 18300 

cgatgctgcc ccagtgggcg tacatgcaca tcgccggaca ggacgcttcg gagtacctga 18360 

gtccgggtct ggtgcagttc gcccgcgcca cagacaccta cttcagtctg gggaacaagt 18420 

ttaggaaccc cacggtggcg cccacgcacg atgtgaccac cgaccgcagc cagcggctga 18480 

cgctgcgctt cgtgcccgtg gaccgcgagg acaacaccta ctcgtacaaa gtgcgctaca 18540 

cgctggccgt gggcgacaac cgcgtgctgg acatggccag cacctacttt gacatccgcg 18600 

gcgtgctgga tcgggggccc agcttcaaac cctactccgg caccgcctac aacagcctgg 18660 

ctcccaaggg agcgcccaac acttgccagt ggacatataa agctggtgat actgatacag 18720 

aaaaaaccta tacatatgga aatgcacctg tgcaaggcat tagcattaca aaggatggta 18780 

ttcaacttgg aactgacagc gatggtcagg caatctatgc agacgaaact tatcaaccag 18840 

agcctcaagt gggtgatgct gaatggcatg acatcactgg tactgatgaa aaatatggag 18900 

gcagagctct taagcctgac accaaaatga agccttgcta tggttctttt gccaagccta 18960 

ccaataaaga aggaggccag gcaaatgtga aaaccgaaac aggcggtacc aaagaatatg 19020 

acattgacat ggcattcttc gataatcgaa gtgcagctgc cgccggccta gccccagaaa 19080 

ttgttttgta tactgagaat gtggatctgg aaactccaga tacccatatt gtatacaagg 19140 

caggtacaga tgacagtagc tcttctatca atttgggtca gcagtccatg cccaacagac 19200 

ccaactacat tggcttcaga gacaacttta tcggtctgat gtactacaac agcactggca 19260 

atatgggtgt actggctgga caggcctccc agctgaatgc tgtggtggac ttgcaggaca 19320 

gaaacaccga actgtcctac cagctcttgc ttgactctct gggtgacaga accaggtatt 19380 

tcagtacgcg gaatcaggcg gtggacagtt atgaccccga tgtgcgcatt attgaaaatc 19440 

acggtgtgga ggatgaactt cctaactatt gcttccccct ggatgctgtg ggtagaactg 19500 

atacttacca gggaattaag gccaatggtg ataatcaaac cacctggacc aaagatgata 19560 

ctgttaatga tgcCaatgaa ttgggcaagg gcaatccttt cgccatggag atcaacatcc 19620 

aggccaacct gtggcggaac ttcctctacg cgaacgtggc gctgtacctg cccgactcct 19680 

acaagtacac gccggccaac atcacgctgc ccaccaacac caacacctac gattacatga 19740 

acggccgcgt ggtggcgccc tcgctggtgg acgcctacat caacatcggg gcgcgctggt 19800 

cgctggaccc catggacaac gtcaacccct tcaaccacca ccgcaacgcg ggcctgcgat 19860 

accgctccat gctcctgggc aacgggcgct acgtgccctt ccacatccag gtgccccaaa 19920 
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agtttttcgc 


catcaagagc ctcctgctcc 


tgcccgggtc 


ctacacctac 


gagtggaact 


19980 


tccgcaagga 


cgtcaacatg atcctgcaga 


gctccctcgg 


caacgacctg 


cgcacggacg 


20040 


gggcctccat 


cgccttcacc 


agcatcaacc 


tctacgccac 


cttcttcccc 


atggcgcaca 


20100 


acaccgcctc 


cacgctcgag gccatgctgc 


gcaacgacac 


caacgaccag 


tccttcaacg 


20160 


actacctctc 


ggcggccaac 


atgctctacc 


ccatcccggc 


caacgccacc 


aacgtgccca 


20220 


tctccatccc 


ctcgcgcaac 


tgggccgcct 


tccgcggctg 


gtccttcacg 


cgcctcaaga 


20280 


cccocaaaar 


gccctcgctc ggctccgggt 


tcgaccccta 


cttcgtctac 


tcgggctcca 


20340 


tcccctacct 


cgacggcacc 


ttctacctca 


accacacctt 


caagaaggtc 


tccatcacct 


20400 




cgtcagctgg 


cccggcaacg 


accgcctcct 


gacgcccaac 


gagttcgaaa 


20460 


tcaaacacac 


cgtcgacgga gaggggtaca 


acgtggccca 


gtgcaacatg 


accaaggact 


20520 


qqttcctqqt 


ccagatgctg gcccactaca 


acatcggcta 


ccagggcttc 


tacgtgcccg 


20580 


agggctacaa 


ggaccgcatg 


tactccttct 


tccgcaactt 


ccagcccatg 


agccgccagg 


20640 


tcgtqqacqa 


ggtcaactac 


aaggactacc 


aggccgtcac 


cctggcctac 


cagcacaaca 


20700 


actcqqqctt 


cgtcggctac ctcgcgccca 


ccatgcgcca 


gggccagccc 


taccccgcca 


20760 


actacccc ta 


cccgctcatc ggcaagagcg 


ccgtcgccag 


egtcacccag 


aaaaagttcc 


20820 


tctgcgaccg 


ggtcatgtgg 


cgcatcccct 


tctccagcaa 


cttcatgtcc 


atgggcgcgc 


20880 




cggccagaac 


atgctctacg 


ccaactccgc 


ccacgcgcta 


gacatgaatt 


20940 


tcqaaatcaa 


ccccatggat 


gagtccaccc 


ttctctatgt 


tgtcttcgaa 


gtcttcgacg 


21000 


v w y u Luy a y i_ 


gcaccagccc 


caccgcggcg 


tcatcgaggc 


cgtctacctg 


cgcacgccct 


21060 




caacgccacc 


acctaagcct 


cttgcttctt 


gcaagatgac 


ggcctgcgcg 


21120 




agcaggagct 


cagggccatc 


ctccgcgacc 


tgggctgcgg 


gccctgcttc 


21180 




tcgacaagcg 


cttcccggga 


ttcatggccc 


cgcacaagct 


ggcctgcgcc 


21240 


atcgtcaaca 


cggccggccg 


cgagaccggg 


ggcgagcact 


ggctggcctt 


cgcctggaac 


21300 


3 3^* ^» V* v» x» 


acacctgcta 


cctcttcgac 


cccttcgggt 


tctcggacga 


gcgcctcaag 


21360 


cagatctacc 


agttcgagta 


cgagggcctg 


ctgcgtcgca 


gcgccctggc 


caccgaggac 


21420 


cgctgcgtca 


ccctggaaaa 


gtccacccag 


accgtgcagg 


gtccgcgctc 


ggccgcctgc 


21480 


gggctcttct 


gctgcatgtt 


cctgcacgcc 


ttcgtgcact 


ggcccgaccg 


ccccatggac 


21540 


aagaacccca 


ccatgaactt 


gctgacgggg 


gtgcccaacg 


gcatgctcca gtcgccccag 


21600 
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coerce, — 9g . S 9c9 «« ace 9cttc=tc „ cgcocactce ji66o 
g cc t .« cec 9 «„ caco9 ^ gwc> ce9ccctcgt ^ 

ca.*c at9t „ CCCMt9t gt9tat9t9a t9c tMt ^ 

9 «c« 9 cc a ecttctct9 , 99ctctgaot teattta9aa ^ ^ 

to t c 99 c, t9 9CCC9C999C . 939acac3t egcggaactg gtactt5Mc ^ 

•ttWt c. 9c a 9 «« 99cac999g , cgagtcgctc ^ 

9 o 9C9 , 9tt9 ca999 . 9ceo , g „ ggccgg gcgo9gasac ^ ^ 

"^^ C9C9C9a3 " «—«*=. *»« 99 .ac .cc. cc , 999 22080 

— *«t c ac9ctC9e , a9caccgtcg cgMggtgat tccsg ^ ^ 

C99C9 "" C °" C " 9 * a3 - 9 c«c. t9 « 999cac9c 22200 

. g cc 999 e tc 9t99tt9ca , tC9cag£gca ^ caecatot93 

. 3 =tc at9 cc CM9ta c atg 90cetcat9 . aage „ cc , g etggcggaag gcccgct3=g ^ 

c=«scc 9 cc c Cc99t9aa9 aa9accecgc aggacttgct agagaaccg9 ^ 

^cc^tc 9t9cacgca9 ca9C9c9cgt cgctgetggc cagctgoacc ^ 

cc^t ct999t9atc tt99cccggt cggggttcto cttcagc9cg ^ 

ccce^c c a c«ec. tc tc9 , tc9tgt gct „ ttctg ^ 

99 c~ 9 „ 9 c« 9c=ctc9 ^ cc99t9c acccgtgcag „ acagcgcg cagcc99tgc 22ho 

^. c CC9t999C9 at « 9gg . gt gcgagtgcac gaagccotgc ^ 

93C "" 9tC " 9 " 9 " 99 e "^"= 22 „0 
e 9 «c.c at . ca99t99cag atacggcggt acacctcgcc ctgctcg9 ^ ^ 

a 99 c 99 .o CC cag9t=gccc tcc>cgcggt accggtccat cagcigcgt= at=accccca 2296o 

t 9 =c=«c t c c=, 99 « 9 „ a<:gaccggc , ggctcagggg gttcttcacc 22Mo 

1 * fM ' C9CC9 " 9t ° ^'"^ SS^Oa.c „«e 9 c Ctg c 22s80 
c 9 l*c«c t c n ^ aaaa . ao9999g9M a9ctc „ cct 230Jo 

e 93 o« 9 c« tccgtcctcg „ gtcctggc tgatgccctg eaa>ggcra ^ 
<~»*. «« tE999c gcggcggaga cgtgcegggc gagcgwgc 22im 

'"^ CaC9aCt '" — W c 99 c ggtagg 22220 

ct 9 c«o tt ccg9ggcaga M o 39a99c9 ac999ctctc gcggttcggc 232<o 

~~»* ^ 9999t9c9ec „ tggoggcg ccgctccgac tgacctrete 222io 
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tgcgctc 


tcctagggag 


caagcatgga 


gactcagcca. 


tcgtcgccaa 


23400 


catcacratr 


tgcccccgcc 


gccgccgacg 


agaaccagca 


gcagcagaat 


gaaagcttaa 


23460 




gcccagcccc 


acctccgacg 


ccgcagcccc 


agacatgcaa 


gagatggagg 


23520 


A a 1" rra h /■via 


gattgacctg 


ggctacgtga 


cgcccgcgga 


gcacgaggag 


gagctggcag 


23580 


cgcgctcttc 


agccccggaa 


gagaaccacc 


aagagcagcc 


agagcaggaa 


gcagagagcg 


23640 




ggctgggctc 


gagcatggcg 


actacctgag 


cggggcagag 


gacgtgctca 


23700 


LCdayCdLCL 


ggcccgccaa 


tgcatcatcg 


tcaaggacgc 


gctgctcgac 


cgcgccgagg 


23760 




cgtggcggag 


ctcagccgcg 


cctacgagcg 


caacctcttc 


tcgccgcgcg 


23820 




gcgccagccc 


aacggcacct 


gcgagcccaa 


cccgcgcctc 


aacttctacc 


238B0 


cggtcttcgc 


ggtgcccgag 


gccctggcca 


cctaccacct 


ctttttcaag 


aaccaaagga 


23940 


tccccgtctc 


ctgccgcgcc 


aaccgcaccc 


gcgccgacgc 


cctgctcaac 


ctgggccccg 


24000 


gcgcccgcct 


acctgatatc 


gcctccttgg 


aagaggttcc 


caagatcttc 


gagggtctgg 


24060 


gcagcgacga 


gactcgggcc 


gcgaacgctc 


tgcaaggaag 


cggagaggag 


catgagcacc 


24120 


acagcgccct 


ggtggagttg 


gaaggcgaca 


acgcgcgcct 


ggcggtcctc 


aagcgcacgg 


24180 


tcgagccgac 


ccacttcgcc 


tacccggcgc 


tcaacctgcc 


ccccaaggtc 


atgagcgccg 


24240 


ted cggacca 


ggtgctcatc 


aagcgcgcct 


cgcccctctc 


ggaggaggag 


atgcaggacc 


24300 


ccgagagctc 


ggacgagggc 


aagcccgtgg 


tcagcgacga 


gcagctggcg 


cgctggctgg 


24360 


gagcgagtag 


caccccccag 


agcctggaag 


agcggcgcaa 


gctcatgatg 


gccgtggtcc 


24420 


tggtgaccgt 


ggagctggag 


tgtctgcgcc 


gcttcttcgc 


cgacgcggag 


accctgcgca 


24480 


aggtcgagga 


gaacctgcac 


tacctcttca 


gacacgggtt 


cgtgcgccag 


gcctgcaaga 


24540 


tctccaacgt 


ggagctgacc 


aacctggtct 


cctacatggg 


catcctgcac 


gagaaccgcc 


24600 


tggggcagaa 


cgtgctgcac 


accaccctgc 


gcggggaggc 


ccgccgcgac 


tacatccgcg 


24660 


actgcgtcta 


cctgtacctc 


tgccacacct 


ggcagacggg 


catgggcgtg 


tggcagcagt 


24720 


gcctggagga 


gcagaacctg 


aaagagctct 


gcaagctcct 


gcagaagaac 


ctcaaggccc 


24780 


tgtggaccgg 


gttcgacgag 


cgcaccaccg 


cc gcggacct 


ggccgacctc 


atcttccccg 


24840 


agcgcctgcg 


gctgacgctg 


cgcaacgggc 


tgcccgactt 


tatgagccaa 


agcatgttgc 


24900 


aaaactttcg 


ctctttcatc 


ctcgaacgct 


ccgggatcct 


gcccgccacc 


tgctccgcgc 


24960 


tgccctcgga 


cttcgtgccg 


ctgaccttcc 


gcgagtgccc 


cccgccgctc 


tggagccact 


25020 
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gctacctgct gcgcctggcc aactacctgg cctaccactc ggacgtgatc gaggacgtca 25080 

Scggcgaggg cctgctcgag tgccactgcc gctgcaacct ctgcacgccg caccgctccc 25140 

tggcctgcaa cccccagctg ctgagcgaga cccagatcat cggcaccttc gagttgcaag 25200 

gccccggcga gggcaagggg ggtctgaaac tcaccccggg gctgtggacc tcggcctact 25260 

tgcgcaagtt cgtgcccgag gactaocatc ccttcgagat caggttctac gaggaccaat 25320 

cccagccgcc caaggccgag ctgtcggcct gcgtcatcac ccagggggcc atcctggccc 25380 

aactgcaagc catccagaaa tcccgccaag aatttctgct gaaaaagggc cacggggtct 25440 

acttggaccc ccagaccgga gaggagctca accccagctt cccccaggat gccccgagga 25500 

agcagcaaga agctgaaagt ggagctgccg ccgccgccgg aggatttgga ggaagactgg 25560 

gagagcagtc aggcagagga ggaggagatg gaagactggg acagcactca ggcagaggag 25620 

gacagcctgc aagacagtct ggaggaggaa gacgaggtgg aggaggcaga ggaagaagca 25680 

gccgccgcca gaccgtcgtc ctcggcggag gaggagaaag caagcagcac ggataccatc 25740 

tccgctccgg gtcggggtcg cggcggccgg gcccacagta gatgggacga gaccgggcgc 25800 

ttcccgaacc ccaccaccca gaccggtaag aaggagcggc agggatacaa gtcctggcgg 25B60 

gggcacaaaa acgccatcgt ctcctgcttg caagcctgcg ggggcaacat ctccttcacc 25920 

cggcgctacc tgctcttcca ccgcggggtg aacUccccc gcaacatctt gcattactac 25980 

cgtcacctcc acagccccta ctactgtttc caagaagagg cagaaaccca gcagcagcag 26040 

cagcagcaga aaaccagcgg cagcagctag aaaatccaca gcggcggcag gtggactgag 26 100 

gatcgcggcg aacgagccgg cgcagacccg ggagctgagg aaccggatct ttcccaccct 26160 

ctatgccatc ttccagcaga gtcgggggca agagcaggaa ctgaaagtca agaaccgttc 26220 

tctgcgctcg ctcacccgca gttgtctgta tcacaagagc gaagaccaac ttcagcgcac 26280 

tctcgaggac gccgaggctc tcttcaacaa gtactgcgcg ctcactctta aagagtagcc 26340 

cgcgcccgcc cacacacgga aaaaggcggg aattacgtca ccacctgcgc ccctcgcccg 26400 

accatcatca tgagcaaaga gattcecacg ccttacatgt ggagctacca gccccagatg 26460 

ggcctggccg ccggcgccgc ccaggactac tccacccgca tgaactggct cagtgccggg 26520 

cccgcgatga tctcacgggt gaatgacatc cgcgcccacc gaaaccagat actcctagaa 26580 

cagtcagcga tcaccgccac gccccgccat caccttaatc cgcgtaattg gcccgccgcc 26640 

ctggtgtacc aggaaattcc ccagcccacg accgtactac ttccgcgaga ogcccaggcc 26700 

gaagtccagc tgactaactc aggtgtccag ctggccggcg gcgccgccct gtgtcgtcac 26760 
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cgccccgctc agggtataaa gcggctggtg atccgaggca gaggcacaca gctcaacgac 26820 

gaggtggtga gctcttcgct gggtctgcga cctgacggag tcttccaact cgccggatcg 26880 

gggagatctt ccttcacgcc tcgtcaggcc gtcctgactt tggagagttc gtcctcgcag 26940 

ccccgctcgg gtggcatcgg cactctccag ttcgtggagg agttcactcc ctcggtctac 27000 

ttcaacccct tctccggctc ccccggccac tacccggacg agttcatccc gaacttcgac 27060 

gccatcagcg agtcggtgga cggctacgat tgaatgtccc atggtggcgc ggctgaccta 27120 

gctcggcttc gacacctgga ccactgccgc cgcttccgct gcttcgctcg ggatctcgcc 27180 

gagtttgcct actttgagct gcccgaggag caccctcagg gcccggccca cggagtgcgg 27240 

atcgtcgtcg aagggggtct cgabtcccac ctgcttcgga tcttcagcca gcgtccgatc 27300 

ctggccgagc gcgagcaagg acagaccctt ctgaccctgt actgcatctg caaccacccc 27360 

ggcctgcatg aaagtctttg ttgtctgctg tgtactgagt ataataaaag ctgagatcag 27420 

cgactactcc ggacttccgt gtgttcctgc tatcaaccag tccctgttct tcaccgggaa 27480 

cgagaccgag ctccagctcc agtgtaagcc ccacaagaag tacctcacct ggctgttcca 27540 

gggctctccg atcgccgttg tcaaccactg cgacaacgac ggagtcctgc tgagcggccc 27600 

tgccaacctt actttttcca cccgcagaag caagctccag ctcttccaac ccttcctccc 27660 

cgggacctat cagtgcgtct cgggaccctg ccatcacacc ttccacctga tcccgaatac 27720 

cacagcgtcg ctccccgcta ctaacaacca aactacccac caacgccacc gtcgcgacct 27780 

ttcctctggg tctaatacca ctaccggagg tgagctccga ggtcgaccaa cctctgggat 27840 

ttactacggc ccctgggagg tggtagggtt aatagcgcta ggcctagttg cgggtgggct 27900 

tttggctctc tgctacctat acctcccttg ctgttcgtac ttagtggtgc tgtgttgctg 27960 

gtttaagaaa tggggaagat caccctagtg agctgcggtg tgctggtggc ggtggtgctt 28020 

tcgattgtgg gactgggcgg cgcggctgta gtgaaggaga aggccgatcc ctgcttgcat 28080 

ttcaatcccg acaaatgcca gctgagtttt cagcccgatg gcaatcggtg cgcggtgctg 2814 0 

atcaagtgcg gatgggaatg cgagaacgtg agaatcgagt acaataacaa gactcggaac 28200 

aatactctcg cgtccgtgtg gcagcccggg gaccccgagt ggtacaccgt ctctgtcccc 28260 

ggtgctgacg gctccccgcg caccgtgaat aatactttca tttttgcgca. catgtgcgac 28320 

acggtcatgt ggatgagcaa gcagtacgat atgtggcccc ccacgaagga gaacatcgtg 28380 

gtcttctcca tcgcttacag cgtgtgcacg gcgctaatca ccgctatcgt gtgcctgagc 28440 
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attcacatsc tcatcgctat tcgccccaga aataatgccg aaaaagaaaa acagccataa 28 500 

cacg tttttt cacacacctt tttcagacca t g 9 cc tct g t taaatttttg c ttttat ttg 28560 

ccagtctcat tgccgtcatt catggaatga gtaatgagaa aattactatt tacactggca 28620 

ctaatcacac a tt gaaagg t ccagaaaaag ccacagaagt ttcatggtat tgttatttta 28680 

atgaatcaga tgtatc.act gaactctgtg gaaacaataa caaaaaaaat gagagcatta 287,0 

ctctcatcaa gtttcaatgt ggatctgact taaccctaat taacatcact agagactatg 28800 

taggtatgta ttatggaact acagcaggca tttcggacat ggaattttat caagtttctg 28860 

tgtctgaacc caccacgcct agaatgacca caaccacaaa aactacacct gttaccacta 28920 

tacagctcac taccaatggc tttcttgcca tgcttcaagt ggctgaaaat agcaccagca 28980 

ttcaacccac cccacccagt gaggaaattc ccagatccat ga tt g gC a tt a tt g ttgct g 29040 

tag t gg t g Cg catgttgatc atcgccttgt gcatggtgta ctatgccttc tgctacagaa 29100 

agcacagact gaacgacaag ctggaacact tactaagtgt tgaattttaa ttttttagaa 29160 

ccatgaagat cctaggcctt ttagtttttt ctatcattac ctctgctcta tgcaattctg 29220 

acaatgagga cgttactg.tc gttgtcggat caaattatac actaaaaggt ccagcaaaag 29280 

gtatgctttc- gtggtattgt tggttcggaa ctgacgagca acagacagaa ctttgcaatg 29340. 

ctcaaaaagg caaaacctca aattctaaaa tctctaatta tcaatgcaat ggcactgact 29400 

tagtattgct caatgtcacg aaagcatatg ctggcagtta cacctgccct ggagatgatg 29460 

ccgacaatat gattttttac aaagtggaag tggttgatcc cactactcca ccgcccacca 29520 

ccacaactac tcataccaca cacacagaac aaacaccaga ggcagcagaa gcagagttgg 29580 

ccttccaggt tcacggagat tcctttgctg tcaatacccc tacacccgat cagcggtgtc 29640 

cggggctgct cgtcagcggc attgtcggtg tgctttcggg attagcagtc ataatcatct 29700 

gcatgttcat ttttgcttgc tgctatagaa ggctttaccg acaaaaatca gacccactgc 29760 

tgaacctcta tgtttaattt tttccagagc catgaaggca gttagcgctc tagttttttg 29820 

ttc ttt ga tt ggcattgttt.ttagtgctgg.gtttttgaaa aatcttacca t.ttatgaagg 29880 

tgagaatgcc actctagtgg gcatcagtgg tcaaaatgtc agctggctaa aataccatct 29940 

agatgggtgg aaagacattt gcgattggaa tgccactgCg tatacatgta atggagtcaa 30000 

cctcaccatt actaatgcca cccaagatca gaatggtagg tttaagggcc agagcttcac 30060 

tagaaataat gggtatgaat cccataacat gtttatotat gacgtcactg tcatcagaaa 30120 

tgagactgcc accaccacac agatgcccac tacacacagt tctaccacta ctaccatgca 30180 
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aaccacacag acaaccacta catcaactca gcatatgacc accactacag cagcaaagcc 30240 

aagtagtgca gcgcctcagc cccaggcttt ggctttgaaa gctgcacaac ctagtacaac 30300 

tactaggacc aatgagcaga ctactgaatt tttgtccact gtcgagagcc acaccacagc 30360 

tacctccagt gccttctcta gcaccgccaa tctctcctcg ctttcctcta caccaatcag 30420 

tcccgctact actcccaccc cagctcttct ccccactccc ctgaagcaaa ctgaggacag 30480 

cggcatgcaa tggcagatca ccctgctcat tgtgatcggg ttggtcatcc tggccgtgtt 3054 0 

gctctactac atcttctgcc gccgcattcc caacgcgcac cgcaaaccgg cctacaagcc 30600 

catcgttatc gggcagccgg agccgcttca ggtggaaggg ggtctaagga atcttctctt 30660 

ctcttttaca gtatggtgat tgaactatga ttcctagaca attcttgatc actattctta 30720 

tctgcctcct ccaagtctgt gccaccctcg ctctggtggc caacgccagt ccagactgta 30780 

ttgggccctt cgcctcctac gtgctctttg ccttcatcac ctgcatctgc tgctgtagca 30840 

tagtctgcct gcttatcacc ttcttccagt tcattgactg gatctttgtg cgcatcgcct 30900 

acctgcgcca ccacccccag taccgcgacc agcgagtggc gcggctgctc aggctcctct 30960 

gataagcatg cgggctctgc tacttctcgc gcttctgctg ttagtgctcc cccgccccgt 3X020 

cgacccccgg tcccccactc agtcccccga agaggtccgc aaatgcaaat tccaagaacc 31080 

ctggaaattc ctcaaatgct accgccaaaa atcagacatg cttcccagct ggatcatgat 31140 

cattgggatc gtgaacattc tggcctgcac cctcatctcc tttgtgattt acccctgctt 31200 

tgactttggt tggaactcgc cagaggcgct ctatctcccg cctgaacctg acacaccacc 31260 

acagcaacct caggcacacg cactaccacc accacagcct aggccacaat acatgcccat 31320 

attagactat gaggccgagc cacagcgacc catgctcccc gctattagtt acttcaatct 31380 

aaccggcgga gatgactgac ccactggcca acaacaacgt caacgacctt ctcctggaca 31440 

tggacggccg cgcctcggag cagcgactcg cccaacttcg cattcgccag cagcaggaga 31500 

gagccgtcaa ggagctgcag gacggcatag ccatccacca gtgcaagaaa ggcatcttct. 31560 

gcctggtgaa acaggccaag atctcctacg aggtcacccc gaccgaccat cgcctctcct 31620 

acgagctcct gcagcagcgc cagaagttca cctgcctggt cggagtcaac cccatcgtca 31680 

tcacccagca gtcgggcgat accaaggggt gcatccactg ctcctgcgac tcccccgact 31740 

gcgtccacac tctgatcaag accctctgcg gcctccgcga cctcctcccc atgaactaat 31800 

caccccctta tccagtgaaa taaatatcat attgatgatg atttaaataa aaaataatca 31860 
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tt tg attt 9 a aataaa.ata caatcatatt gatgatttga gttttaaaaa ^ 

acttacttsa aa tctgatac caggtctctg ^ 

cectcttecc a 9 ctc tggta ctgcagaccc cggcgggctg ^ 

aa 9ggg a tgt caaattcctc ctstccctca atcttcattt tatcttcta , cagatgtcca . 321Q0 

aaaa g c gcgt cc gggtggat gatgacttcg accccgtcfca ^ 

c a cc gaccgt 3 cco tt ca tc aaccccccct tcgtctcttc agatggattc caagagaagc ^ 

ccc tgggggt gctgtccctg cgactggctg accccgtcac ^^^^ ^^^^ 322flo 

ccc t caa gct gggagagggg gtggacctcg actcctcggg aaaacfccatc tccaacacgg ^ 

ccacc aaggc cgccgcccct ctcagtttfct ccaacaacac ^^^^^ aacatggata 324oq 

cccctc ttta taccaaagat gg aaa attat cc tt aca agt ttctccaccg ttaaaca tat 32460 

taa aatca ac ca tt c tgaac aca ttagctg tagcttatgg atcaggttfca ^ 

9 t33 c a c tgc tc ttgca9ta cagttggcct ctccactcac ttttgatgag aaaggaaata 325oo 

t ta aaattaa cc tagC c agC ggtC c attaa cagttgatgc aagccgactt agtatcaact 3264o 

9Caaaagagg ggtcactgtc actacctcag gagatgcaat tgaaagcaac ataagctggc 327oo 

c taaaggtat aagatttgaa ggtaatggca tagctgcaaa cafctggcaga ggafctggaat 32?6o 

t tggaa cc ac tagtacagag actgatgfcca cagatgcata cccaafctcaa gttaaattgg , 3282o 

9tac tggcct taC c tttgac agtacaggcg ccattgttgc ttggaacaaa gaggatgata 3288o 

aac ttaC a tt atggaccaca gccgacccct cgccaaatCg caaaatatac tcCgaaaaag ^ 

a tg cc aaact cacac tttgc ttgacaaagt gtggaagtca aattctgggt ^ 

ta Ctg3 ca g c gaataatgga agtctcaacc caatcacaaa cacagtaagc ac . tgcactcg 33oeo 

tc t ccc tCa a gt tt gatgC a a gtggagttt tgctaagcag ctccacatta gacaaagaat ^ 

att ggaactt Cagaaaggga gatgttacac ctgctgagcc ctatacfcaat gctafcaggtt 33ieo 

ttat9CCtaa Cataaa ^ cc tatcctaa aa acacatctgc agctt c aaaa agC c atattg 33240 

t c agtcaagt ttatotcaat gg99atgagg cc aaaC c act gatgctgatC actac tttt a. 33300 

at ga a actga ggatgcaact tgcacctaca gtatcactfct tcaatggaaa tgggatagfca 333eo 

ct aagtaca c aggtgaaaca cttgctacca gctccttcac cttctcctac atcgcccaag 3342Q 

aatgaacact gtatcccacc ct g c atg cca .cccttccca cccc act c tg tctatggaaa 33400 

aaactctgaa gcacaaaata aaataaa gtt ca agtgtttt attgattcaa cagttttaca 33540 

g9 att cgagc agttattttt cctccaccct cccaggacat ggaatacacc accctctccc 336Qo 
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cccgcacagc cttgaacatc tgaatgccat tggtgatgga catgcttttg gcctccacgt 33660 

tccacacagt ttcagagcga gccagtctcg ggtcggtcag ggagatgaaa ccctccgggc 33720 

actcccgcat ctgcacctca cagctcaaca gctgaggatt gtcctcggtg gtcgggatca 33780 

cggttatctg gaagaagcag aagagcggcg gtgggaatca tagtccgcga acgggatcgg 33840 

ccggtggtgt cgcatcaggc cccgcagcag tcgctgccgc cgccgctccg tcaagctgct 33900 

gctcaggggg tccgggtcca gggactccct cagcatgatg cccacggccc tcagcatcag 33960 

tcgtctggtg cggcgggcgc agcagcgcat gcggatctcg ctcaggtcgc tgcagtacgt 34020 

gcaacacagg accaccaggt tgttcaacag tccatagttc aacacgctcc agccgaaact 34080 

catcgcggga aggatgctac ccacgtggcc gtcgtaccag atcctcaggt aaatcaagtg 34140 

gcgctccctc cagaacacgc tgcccacgta catgatctcc ttgggcatgt ggcggttcac 34200 

cacctcccgg taccacatca ccctctggtt gaacatgcag ccccggatga tcctgcggaa 34260 

ccacagggcc agcaccgccc cgcccgccat gcagcgaaga gaccccgggt cccggcaatg 34320 

gcaatggagg acccaccgct cgtacccgtg gatcatctgg gagctgaaca agtctatgtt 34380 

ggcacagcac aggcatatgc tcatgcatct cttcagcact ctcagctcct cgggggtcaa 34440 

aaccatatcc cagggcacgg ggaactcttg caggacagcg aaccccgcag aacagggcaa 34500 

tcctcgcaca taacttacat tgtgcatgga cagggtatcg caatcaggca gcaccgggtg 34560 

atcctccacc agagaagcgc gggtctcggt ctcctcacag cgtggtaagg gggccggccg 34620 

atacgggtga tggcgggacg cggctgatcg tgttcgcgac cgtgtcatga tgcagttgct 34680 

ttcggacatt ttcgtacttg ctgtagcaga acctggtccg ggcgctgcac accgatcgcc 34740 

ggcggcggtc ccggcgcttg gaacgctcgg tgttgaaatt gtaaaacagc cactctctca 34800 

gaccgtgcag cagatctagg gcctcaggag tgatgaagat cccatcatgc ctgatagctc 34860 

tgatcacatc gaccaccgtg gaatgggcca gacccagcca gatgatgcaa ttttgttggg 34920 

tttcggtgac ggcgggggag ggaagaacag gaagaaccat gattaacttt taatccaaac 34980 

ggtctcggag cacttcaaaa tgaaggtcgc ggagatggca cctctcgccc ccgctgtgtt 35040 

ggtggaaaat aacagccagg tcaaaggtga tacggttctc gagatgttcc acggtggctt 35100 

ccagcaaagc ctccacgcgc acatccagaa acaagacaat agcgaaagcg ggagggttct 35160 

ctaattcctc aatcatcatg ttacactcct gcaccatccc cagataattt tcatttttcc 35220 

agccttgaat gattcgaact agttcctgag gtaaatccaa gccagccatg ataaagagct 35280 
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cgcgcagagc gccctccacc ggcattctta agcacaccct cataattcca a 9atattctg 35 340 

c t cct 99ttc acc t9 ca 9 ca 9 a tt9 acaa 9 c 99 aa t a tC a aaatctctgc c 9 c 9 atccc t 35400 

aa 9 ctcctcc c t ca 9 caata actgtaagta ctctttcata tcc t c tC c 9 a aatttttagc 35460 

cata 99 acca ccaggaataa gattagggca agccacagta cagataaacc gaagtcctcc 35520 

ccagt 9 a 9 ca ttgccaaatg caagac tgct ataagcatgc t 93 c t agacc cggtgatatc 35580 

ttccagataa c t ggacagaa aatcacccag gcaa ttttta agaaaatcaa caaaa 9 aaaa 35640 

atcctccagg tgcacgtcta gagcctcggg aacaacgatg aagtaaatgc aagcggtgcg 35700 

ttccagcatg g ttagttagc tgatctgtaa aaaacaaaaa ^ 

agcctggcga ac a g 9tgg9t aaatcgttct ctccagcacc aggcagg<=ca cggggtcfccc ^ 

ggcgcgaccc tcgtaaaaat tgtcgctatg attgaaaacc atcacagaga . gacgttcccg 35880 

9tggccggc 9 tgaatgattc gacaagatga atacaccccc gg aacattgg cgtccg cgag 35940 

tgaaaaaaag cgcccgagg a agcaataagg cactacaatg ctcagtctca agtccagcaa 36000 

agcgatgcca tgcg 9atgaa gcacaaaatc ctc aggtgcg t.caaaatgt aattactccc 36060 

ctcctgcaca ggcagcgaag ccccc gatC c ctccagatac acatac aaag cctc ag c 9 tc 36 120 

catagcttac cgagcagcag cacacaacag gcgcaagagt cagagaaagg ctgagctcta. 36X80 

acctgcccac ccgctc.ctg c tCaatatat agcccagatc tacactg.cg taaaggcc.a 36240 

agcctaaaaa tacccgccaa ataatcacac acgcccagca cacgcccaga aaccggtgac 36300 

acactcaaaa aaatacgcgc acttcctcaa acgcccaaac t g cc gt ca tt t ccggg ttC c 36360 

cacgctacgt catc gg aat t c g ac tttcaa attcc g tc g a ccg ttaaaaa cgtcacccgc 3 6420 

cccgccccta acggtcgccc gtctctcggc caatcacctt cctccctccc caaattcaaa 36480 
cagctcattt gcatat taac gcgcaccaaa a 9 t ttgaggt atattattga tgatg 



36535 



<2X0> 4 

<211> 34264 

<212> DNA 

<213> simian adenovirus SV-l 

<400> 4 



tcctt at tct gga aa c 9tg c c aata tg ata atgagcgggg agg agcgagg c 9ggg cc 999 
gtgacgtgcg gtgacgtggg gtga cgcggg gtggcgcgag ggc ggggcgg g a g tggggag 
9 cgctt a9tt tttac gtat g cgg aaggagg tttt a t acc g gaagt t9ggt aatttgggcg 
tatacttgta agttttgtgt aatttggcgc gaaaaccggg taatgaggaa gttgaggtta 



60 
120 
180 
240 
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atatgtactt tttatgactg 9 gcg 9 aattt 


ctgctgatca 


gcagtgaact 


ttgggcgctg 


300 


acggggaggt ttcgctacgt ggcagtacca 


cgagaaggct 


caaaggtccc 


atttattgta 


360 


ctcctcagcg ttttcgctgg gtatttaaac 


gctgtcagat 


catcaagagg 


ccactcttga 


420 


gtgccggcga gtagagtttt ctcctccgcg 


ctgCcgcgat 


gaggctggtt 


cccgagatgt 


480 


acggtgtttt ctgcagcgag acggcccgga 


actcagatga 


gctgcttaat 


acagatctgc 


540 


tggatgttcc caactcgcct gtggcttcgc 


ctccgtcgct 


tcatgatctt 


ttcgatgtgg 


600 


aagtggatcc accgcaagat cccaacgagg 


acgcggtaaa 


cagtatgttc 


cctgaatgtc 


660 


tgtttgaggc ggctgaggag ggttctcaca 


gcagtgaaga 


gagcagacgg 


ggagaggaac 


720 


tggacttgaa atgctacgag gaatgtctgc 


cttctagcga 


ttctgaaacg 


gaacagacag 


780 


ggggagacgg ctgtgagtcg gcaatgaaaa 


atgaacttgt 


attagactgt 


ccagaacatc 


840 


ctggtcatgg ctgccgtgcc tgtgcttttc 


atagaaatgc 


cagcggaaat 


cctgagactc 


900 


tatgtgctct gtgttatctg cgccttacca 


gcgattttgt 


atacagtaag 


taaagtgttt 


960 


tcattggcgt acggtagggg attcgttgaa 


gtgctttgtg 


acttattatg 


tgtcattatt 


1020 


tctaggtgac gtgtccgacg tggaagggga 


aggagataga 


tcaggggctg 


ctaattctcc 


1080 


ttgcactttg ggggctgtgg ttccagttgg 


catttttaaa 


ccgagtggtg 


gaggagaacg 


1140 


agccggagga gaccgagaat ctgagagccg 


gcctggaccc 


tccagtggaa 


gactaggtgc 


1200 


tgaggatgat cctgaagagg ggactagtgg 


gggtgctagg 


aaaaagcaaa 


aaactgagcc 


1260 


tgaacctaga aactttttga atgagttgac 


tgtaagccta 


atgaatcggc 


agcgtcctga 


1320 


gacggtgttt tggactgagt tggaggatga 


gttcaagaag 


ggggaattaa 


acctcttgta 


1380 


caagtatggg tttgagcagt tgaaaactca 


ctggttggag 


cc gtgggagg 


atatggaaat 


1440 


ggctctagac acctttgcta aagtggctct 


gcggccggat 


aaagtttaca 


ctattcgccg 


1500 


cactgttaat ataaaaaaga gtgtttatgt 


tatcggccat 


ggagctctgg 


tgcaggtgca 


1560 


gaccccagac cgggtggctt tcaattgcgg 


catgcagagt 


ttgggccccg 


gggtgatagg 


1620 


tttgaatgga gttacatttc aaaatgtcag 


gtttactggt 


gatgatttta 


atggctctgt 


1680 


gtttgtgact agcacccagc taaccctcca 


cggtgtttac 


ttttttaact 


ttaacaatac 


1740 


atgtgtggag tcatggggta gggtgtctct 


gaggggctgc 


agttttcatg 


gttgctggaa 


1800 


ggcggtggtg ggaagaatta aaagtgtcat 


gtctgtgaag 


aaatgcatat 


ttgaacgctg 


1860 


tgtgatagct ctagcagtag aggggtacgg 


acggatcagg 


aataacgccg 


catctgagaa 


1920 


tggatgtttt cttttgctga aaggtacggc 


cagcgttaag 


cataatatga 


tttgcggcag 


1980 
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cggcctgtgc ccctcgcagc tcttaacttg cgcagatgga aactgtcaca ccttgcgcac 2040 

cgtgcacata gtgtcccact cgcgccgcac ctggccaaca tttgagcaca atatgctcat 2100 

gcgttgcgcc gttcacctag gtgctagacg cggcgtgttt atgccttatc aatgtaactt 2160 

tagtcatact aagattttgc tggaaactga ttccttccct cgagtatgtt tcaatggggt 2220 

gtttgacatg tcaatggaac tttttaaagt gataagatat gatgaaacca agtctcgttg 2280 

tcgctcatgt gaatgcggag ctaatcattt gaggttgtat cctgtaaccc tgaacgttac 2340 

cgaggagctg aggacggacc accacatgct gtcttgcctg cgtaccgact atgaatccag 2400 

cgatgaggag tgaggtgagg ggcggagcca caaagggtat aaaggggcat gaggggtggg 2460 

cgcggtgttt caaaatgagc gggacgacgg acggcaatgc gtttgagggg ggagtgttca 2520 

gcccatatct gacatctcgt cttccttcct gggcaggagt tcgtcagaat gtagtgggct 2 580 

ccaccgtgga cggacggccg gtcgcccctg caaattccgc caccctcacc tatgccaccg 2640 

tgggatcatc gttggacact gccgcggcag ctgccgcttc tgctgccgct tctactgctc 2700 

gcggcatggc ggctgatttt ggactatata accaactggc cactgcagct gtggcgtctc 2760 

ggtctctggt tcaagaagat gccctgaatg tgatcttgac tcgcctggag. atcatgtcac 2820 

gtcgcctgga cgaactggct gcgcagatat cccaagctaa ccccgatacc. gcttcagaat 2880 

cttaaaataa agacaaacaa atttgttgaa aagtaaaatg gctttatttg ttttttttgg 2940 

ctcggtaggc tcgggtccac ctgtctcggt cgttaaggac tttgtgtatg ttttccaaaa 3000 

cacggtacag atgggcttgg atgttcaagt acatgggcat gaggccatct ttggggtgga 3 060 

gataggacca ctgaagagcg tcatgttccg gggtggtatt gtaaatcacc cagtcgtagc 3120 

agggtttttg agcgtggaac tggaatatgt ccttcaggag caggctaatg gccaagggta 3180 

gacccttagt gtaggtgttt acaaagcggt tgagctggga gggatgcatg cggggggaga 3240 

tgatatgcat cttggcttgg attttgaggt tagctatgtt accacccagg tctctgcggg 3300 

ggttcatgtt atgaaggacc accagcacgg tatagccagc gcatttgggg aacttgtcat 3360 

gcagtttgga ggggaaggcg tggaagaatt tagatacccc cttgtgcccc cctaggtttt 3420 

ccatgcactc atccataata atggcaatgg gacccctggc ggccgcttta gcaaacacgt 34 80 

tttgggggtt ggaaacatca tagctttgct ctagagtgag ctcatcatag gccatcttta 3540 

caaagcgggg taggagggtg cccgactggg ggatgatagt tccatctggg cctggagcgt 3600 

agttgccctc acagatctgc atctcccagg ccttaattcc cgaggggggg atcatgtcca 3660 
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cctggggggc 


gataaaaaac 


acggtttctg 


gcggggggtt 


aatgagctgg 


gtggaaagca 


3720 


agttacgcaa 


cagctgggat 


ttgccgcaac 


cggtgggacc 


gtagatgacc 


ccgatgacgg 


3780 


gttgcagctg 


gtagttcaga 


gaggaacagc 


tgccgtcggg 


gcgcaggagg 


ggagctacct 


3840 


cattcatcat 


gcttctgaca 


tgtttatttt 


cactcactaa 


gttttgcaag 


agcctctccc 


3900 


cacccaggga 


taagagttct 


tccaggctgt 


tgaagtgttt 


cagcggtttc 


aggccgtcgg 


3960 


ccatgggcat 


cttttcaagc 


gactgacgaa 


gcaagtacag 


tcggtcccag 


agctcggtga 


4020 


cgtgctctat 


ggaatctcga 


tccagcagac 


ttcttggttt 


cgggggttgg 


gccgactttc 


4080 


gctgtagggc 


accagccggt 


gggcgtccag 


ggccgcgagg 


gttctgtcct 


tccagggtct 


4140 


cagcgttcgg 


gtgagggtgg 


tctcggtgac 


ggtgaaggga 


tgagccccgg 


gctgggcgct 


4200 


tgcgagggtg 


cgcttcaggc 


tcatcctgct 


ggtgctgaag 


cgggcgtcgt 


ctccctgtga 


4260 


gtcggccaga 


tagcaacgaa 


gcatgaggtc 


gtagctgagg 


gactcggccg 


cgtgtccctt 


4320 


ggcgcgcagc 


tttcccttgg 


aaacgtgctg 


acatttggtg 


cagtgcagac 


acttgagggc 


4380 


gtagagtttt 


ggggccagga 


agaccgactc 


gggcgagtag 


gcgtcggctc 


cgcactgagc 


4440 


gcagacggtc 


tcgcactcca 


ccagccacgt 


gagctcgggt 


ttagcgggat 


caaaaaccaa 


4500 


gttgcctcca 


ttttttttga 


tgcgtttctt 


accttgcgtc 


tccatgagtc 


tgtgtcccgc 


4560 


ttccgtgaca 


aaaaggctgt 


cggtatcccc 


gtagaccgac 


ttgagggggc 


gatcttccaa 


4620 


aggtgttccg 


aggtcttccg 


cgtacaggaa 


ctgggaccac 


tccgagacaa 


aggctcgggt 


4680 


ccaggctaac 


acgaaggagg 


cgatctgcga 


ggggtatctg 


tcgttttcaa 


tgagggggtc 


4740 


caccttttcc 


a 999tgtgca 


gacacaggtc 


gtcctcctcc 


gcgtccacga 


aggtgattgg 


4800 


cttgtaagtg 


taggtcacgt 


gacccgcacc 


cccccaaggg 


gtataaaagg 


9ggcgtgccc 


4860 


actctccccg 


tcactttctt 


ccgcatcgct 


gtggaccaga 


gccagctgtt 


cgggtgagta 


4920 


ggccctctca 


aaagccggca 


tgatttcggc 


gctcaagttg 


tcagtttcta 


caaacgaggt 


4980 


ggatttgata 


ttcacgtgcc 


ccgcggcgat 


gcttttgatg 


gtggaggggt 


ccatctgatc 


5040 


agaaaacacg 


atctttttat 


tgtcaagttt 


ggtggcgaaa 


gacccgtaga 


gggcgttgga 


5.100 


aagcaacttg 


gcgatggagc 


gcagggtctg 


atttttctcc 


cgatcggccc 


tctccttggc 


5160 


ggcgatgttg 


agttgcacgt 


actcgcgggc 


cacgcaccgc 


cactcgggga 


acacggcggt 


5220 


gcgctcgtcg 


ggcaggatgc 


gcacgcgcca 


gccgcggttg 


tgcagggtga 


tgaggtccac 


5280 


gctggtggcc 


acctccccgc 


ggaggggctc 


gttggtccaa 


cacaatcgcc 


ccccttttct 


5340 


ggagcagaac 


ggaggcaggg 


gatctagcaa 


gttggcgggc 


ggggggtcgg 


cgtcgatggt 


5400 
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aaatatgccg 


ggtagcagaa 


ttttattaaa ataatcgatt 


tcggtgtccg tgtcttgcaa 


5460 


cgcgtcttcc 


each hrht-^a 


ccgccagggc ectttegtag 


ggattcaggg gcggtcccca 


5520 




^aggecaggg 


ccgaggcgta catgccgcag 


atgtcgtaca cgtacagggg 


55B0 


CtCCCtcaar 


dccccgatgc 


aagtggggta acagcgcccc 


ccgcggatgc tggctcgcac . 


5640 


Qtaatcatac 


aLctcg cgag 


agggagecat gagcccgtct 


cccaagtggg tcttgtgggg 


5700 


v» *— v. i» >— y y v> 


c 99tagagga 


tetgectgaa gatggcgtgg 


gagttggaag agatagtggg 


5760 




acgttaaagt 


tggctccggg cagtcccacg 


gagtcttgga tgaactgggc 


5820 




c ggagcttgt 


ccaccagggc tgcggttacc 


ageaegtega gagegcagta 


5880 


y cccaacgcc 


tcgcggacca 


ggttgtaggc cgtctcttgt 


tttttctccc acagttcgcg 


5940 


accgaggagg 


tattcctcgc 


ggtccttcca gtactcttcg 


gegggaaate etttttegtc 


6000 


cgctcggtaa 


gaacctaaca 


tgtaaaattc gttcaegget 


ttgtatggac aacagecttt 


6060 


ttctaccggc 


agggegtacg 


ettgagegge ctttctgaga 


gaggtgtggg tgagggcgaa 


6120 


ggtgtcccgc 


accatcactt 


tcaggcactg atgtttgaag 


tccgtgtcgt cgcaggcgcc 


6180 


c ty ctcccac 


agcgtgaagt 


eggtgegett ttcctgcctg 


ggattgggga gggcgaatgt 


6240 




aagaggattt 


tcccggcgcg gggcatgaag 


ttgegagaga tcctgaaggg 


6300 


t-uwyyycacg 


tecgageggt 


tgetgatgae ttgcgccgcc 


aggacgatct cgtcgaagcc 


6360 




tggcccacga 


tgtaaagttc gataaagege 


ggctgtccct tgagggcegg 


6420 


cgctCCCCCC 


aactcctcgt 


aggtgagaca gtceggegag 


gagagaccca gctccgcccg 


6480 


yyt-ccag c eg 


gagagctgag 


ggttagccgc gaggaaagag 


ctccacaggt caagggctag 


6540 


cagagtttgc 


aageggtege 


ggaactcgcg aaactttttc 


cccacggcca ttttctcegg 


6600 


cgtcaccacg 


tagaaagtgc 


aggggeggtc gttccagacg 


tcccatcgga gctctagggc 


6660 


cagcccgcag 


gcttgacgaa 


cgagggtctc ctcgcccgag 


acgtgcatga ccagcatgaa 


6720 


yygcaccaac 


tgtttcccga 


acgagcccat ccatgtgtag 


gtttctacgt cgtaggtgac 


6780 


aaagagcege 


tgggtgcgcg 


cgtgggagcc gategggaag 


aagctgatct cctgccacca 


■ 6840 


gttggaggaa 


tgggtgttga 


tgtggtgaaa gtagaagtcc 


cgccggcgca cagagcattc 


6900 


gtgctgatgt 


ttgtaaaagc 


gaeegcagta gtcgcagcgc 


tgcacgctct gtatctcctg 


6960 


aatgagatgc 


gcttttcgcc 


cgcgcaccag aaaceggagg 


gggaagttga gaeggggget 


7020 


tggtggggcg 


gcatcccctt 


cgccttggcg gtgggagtct 


gcgtctgcgc cctccttctc 


7080 
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aacac ggcgg tgcgctcgtc g 9g ca gg at g cg cac g c g cc a g cc gcg a tt gtgcagggt9 
at g a ggtC ca c gctggtagc cacctccccg cggaggggcC cgttggtcca acacaatcgc 
ccccc ttttc tggagcagaa cgga3gcagg ggatctagca ^^^^ 

9c gt c g at gg tgaagatacc gggtagcag9 atcttattaa aataatcgat ttcggtgtcc 
gtg tc ttg ca acgcgtcttc ccacttcttc acc g cca ggg ccctttcgta gggattcagg 
g3 c 3gt cccc a gggcatggg gtgggtcagg gccgaggcgt acatgccgca gatgtcatac 
ac gt aca ggg gttccctcaa caccccgatg taagtggggt aacagcgccc cccgcggatg 

ct gg c t c 9 ca c gtag tc gta catc t c gcgc g a gggag cca t gaggC c gtc t cccaa gtgg 
9tct tgtggg gtttttcggc ccggtagagg atctgtctga agatggcgtg ^^^^ 

3 a 9 a tggtgg ggcgCtggaa gacgttaaag ttggccccgg gtagtcccac ggagtcttgg 
at 9 aac tggg c gtaggattc C c ggagtttg tccaccaggg cggcggtcac cagcacgtcg 

a9a9e9M9t a9tCCaaC 9t ct <=9c 93 acc a 9gttgtagg C c gtc tc Ctg ttttttctcc 
cacagttcgc ggttgaggag . gtattcctcg cggtctttcc agtactcttc ggcgggaaat 

cccttttcgt cc gctCggta agaaC ctaac atgtaaaatt cgttcaccgc tttgtatgga 
caaca 9 cctt tttctaccgg ca 9gg c gtac 9 c ttgagcgg cc tttctgag a 9 a 9gtgtgg 
9 t 9 a gggc9 a a ggtgtccc9taccaccact ttcaggtact gatgtttgaa gtccgfcgtcg 
tcgcaggcgc cct gttC cca ca gcgtg aa 9 t c 9gtgcg c t ttttctgcct ggg att gggg 
a gggCgaagg tgacatcgtt aaagagtatc ttcccggcgc ggggcatgaa gttgcgagag 
atcct gaagg 9C cc 9gg c a c 9 tcc 9 a 9 c 9g ttgttg-tg. cct 9 c 9 cc gc ca ggacgatc 
tc g tc 9 aa gc c gttgatgtt gtgacccacg atgtaaagtt cgatgaagcg cggctgtccc 
tt9 a ggg cc g gc9 c tttttt caac t cctc 9 ta ggtgagac agtccggcga ggagagaccc 
a 9 ctc a9 occ 993 ccca 9t c g9aga9ttga 9gattagccg caaggaagga gctccataga 
tccaaggcca ggagagtttg caa gcggt c 9 c gg aa Ctcg c gg aac ttttt ccccacggcc 
atttcctccg gt9 tcac t ac gt aaaa ggtg tt 9gggcggt . tgttccacac gtcccatcgg 
a gct cta ggg ccagctcgca 9g ctt gg c ga a c g a 9gg tc t cctcgccaga gacgtgcatg 
accagcataa a gggtacC aa ct gttt ccc g aacgagccca tccat gtgta ggtttctacg 
tc 9 ta ggtga caaagagccg ct gggtgcg c g c gtgggagc cgatcggaaa gaagctgatc 
tcct 9 cc a cc a 9 c tgga9ga atgggtgtta atgtggtgga agtagaagtc cdgccggcgc 
acagagcatt cgtgctgatg tttgtaaaag cgaccgcagt agtcgcagcg ctgcacgctc 
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tgtatctcct 


gaacgagatg 


cgcttttcgc 


ccgcgcacca 


gaaaccggag 


ggggaagttg 


6840 


agacgggggg 


ctggtggggc 


gacatcccct 


tcgccttggc 


ggtgggagtc 


tgcgtctgcg 


6900 


tcctccttct 


ctgggtggac 


gacggtgggg 


acgacgacgc 


cccgggtgcc 


gcaagtccag 


6960 


atctccgcca 


cggaggggtg 


caggcgctgc 


aggaggggac 


gcagctgccc 


gctgtccagg 


7020 


gagtcgaggg 


aagtcgcgct 


gaggtcggcg 


ggaagcgttt 


gcaagttcac 


tttcagaaga 


7080 


ccggtaagag 


cgtgagccag 


gtgcagatgg 


tacttgattt 


ccaggggggt 


gttggatgaa 


7140 


gcgtccacgg 


cgtagaggag 


tccgtgtccg 


cgcggggcca 


ccaccgtgcc 


ccgaggaggt 


7200 


tttatctcac 


tcgtcgaggg 


cgagcgccgg 


ggggtagagg 


cggctctgcg 


ccggggggca 


7260 


gcggaggcag 


aggcacgttt 


tcgtgaggat 


tcggcagcgg 


ttgatgacga 


gcccggagac 


7320 


tgctggcgtg 


ggcgacgacg 


cggcggttga 


ggtcctggat 


gtgccgtctc 


tgcgtgaaga 


7380 


ccaccggccc 


ccgggtcctg 


aacctaaaga 


gagttccaca 


gaatcaatgt 


ctgcatcgtt 


7440 


aacggcggcc 


tgcctgagga 


tctcctgcac 


gtcgcccgag 


ttgtcctgat 


aggcgatctc 


7500 


ggccatgaac 


tgttccactt 


cttcctcgcg 


gaggtcaccg 


tggcccgctc 


gctccacggt 


7560 


ggcggccagg 


tcgttggaga 


tgcggcgcat 


gagttgagag 


aaggcgttga 


ggccgttctc 


7620 


gttccacacg 


cggctgtaca 


ccacgtttcc 


gaaggagtcg 


cgcgctcgca 


tgaccacctg 


7680 


ggccacgttg 


agttccacgt 


ggcgggcgaa 


gacggcgtag 


tttctgaggc 


gctggaagag 


7740 


gtagttgagc 


9tggtggcga 


tgtgctcgca 


gacgaagaag 


tacataatcc 


agcgccgcag 


7800 


ggtcatctcg 


ttgatgtctc 


cgatggcttc 


gagacgctcc 


atggcctcgt 


agaagtcgac 


7860 


ggcgaagttg 


aaaaattggg 


agttgcgggc 


ggccaccgtg 


agttcttctt 


gcaggaggcg 


7920 


gatgagatcg 


gcgaccgtgt 


cgcgcacctc 


ctgttcgaaa 


gcgccccgag 


gcgcctctgc 


7980 


ttcttcctcc 


ggctcctcct 


cttccagggg 


ctcgggttcc 


tccggcagct 


ctgcgacggg 


8040 


gacggggcgg 


cgacgtcgtc 


gtctgaccgg 


caggcggtcc 


acgaagcgct 


cgatcatttc 


8100 


gccgcgccgg 


cgacgcatgg 


tctcggtgac 


ggcgcgtccg 


ttttcgcgag 


gtcgcagttc 


8160 


gaagacgccg 


ccgcgcagag 


cgcccccgtg 


cagggagggt 


aagtggttag 


ggccgtcggg 


8220 


cagggacacg 


gcgctgacga 


tgcattttat 


caattgctgc 


gtaggcactc 


cgtgcaggga 


8280 


tctgagaacg 


tcgaggtcga 


cgggatccga 


gaacttctct 


aggaaagcgt 


ctatccaatc 


8340 


gcaatcgcaa 


ggtaagctga 


gaacggtggg 


tcgctggggg 


gcgttcgcgg 


gcagttggga 


8400 


ggtgatgctg 


ctgatgatgt 


aattaaagta 


ggcggtcttc 


a ggcggcgga 


tggtggcgag 


8460 
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gaggaccacg 


tctttgggcc 


cggcctgttg 


aatgcgcagg 


cgctcggcca 


tgccccaggc 


B520 


ctcgctctga 


cagcgacgca 


ggtctttgta 


gaagtcttgc 


atcagcctct 


ccaccggaac 


8530 


ctctgcttct 


cccctgtctg 


ccatgcgagt 


cgagccgaac 


ccccgcaggg 


gctgcagcaa 


8640 


cgctaggtcg 


gccacgaccc 


tttcggccag 


cacggcctgt 


tgaatctgcg 


tgagggtggc 


8700 


ctggaagtcg 


tccaggtcca 


cgaagcggtg 


ataggccccc 


gtgttgatgg 


tgtaggtgca 


8760 


gttggccatg 


acggaccagt 


tgacgacttg 


catgccgggt 


tgggtgatct 


ccgtgtactt 


8820 


gaggcgcgag 


taggccctgg 


actcgaacac 


gtagtcgttg 


catgtgcgca 


ccagatactg 


8880 


gtagccgacc 


aggaagtgag 


gaggcggctc 


tcggtacagg 


ggccagccaa 


cqgtgqcqqq 


8940 


ggcgccgggg 


gacaggtcgt 


ccagcatgag 


gcggtggtag 


tggtagatgt 


agcggqaqaq 


9000 


ccaggtgatg 


ccggccgagg 


tggttgcggc 


cctggtgaat 


tcgcggacgc 


ggttccagat 


9060 


gttgcgcagg 


ggaccaaagc 


gctccatggt 


gggcacgctc 


tgccccgtga 


ggcgggcgca 


9120 


atcttgtacg 


ctctagatgg 


aaaaaagaca 


gggcggtcat 


cgactccttt 


ccgtagcttg 


91B0 


gggggtaaag 


tcgcaagggt 


gcggcggcgg 


ggaaccccgg 


ttcgagaccg 


qccqqatccq 


9240 


ccgctcccga 


tgcgcctggc 


cccgcatcca 


cgacgtccgc 


gccgagaccc 


aqccqcqacQ 


9300 


ctccgcccca 


atacggaggg 


gagtcttttg 


gtgttttttc 


gtagatgcat 


CCQQtQCtQC 


9360 


ggcagatgcg 


accccagacg 


cccactacca 


ccgccgtggc 


ggcagtaaac 


ctqaqcqqaq 


9420 


gcggtgacag 


ggaggaggaa 


gagctggctt 


tagacctgga 


agagggagag 


gggctqqccc 


9480 


ggctgggagc 


gccatcccca 


gagagacacc 


ctagggttca 


gctcgtgaqq 


cjacqccaaac 


9540 


aggcttttgt 


gccgaagcag 


aacctgttta 


gggaccgcag 


cggtcaqqaq 


QCQaaaaaaa 


9600 


tgcgcgattg 


caggtttcgg 


gcgggcagag 


agctcagggc 


gggcttcgat 


CQqaaacaac 


9660 


tcctgagggc 


ggaggatttc 


gagcccgacg 


agcgttctgg 


ggtgagcccg 


gcccgcgctc 


9720 


acgtatcggc 


ggccaacctg 


gtgagcgcgt 


acgagcagac 


ggtgaacgag 


gagcgcaact 


9780 


tccaaaagag 


ctttaacaat 


cacgtgagga 


ccctgatcgc 


gagggaggag 


gtgaccatcg 


9840 


ggctgatgca 


tctgtgggac 


ttcgtggagg 


cctacgtgca 


gaacccggct 


agcaaacccc 


9900 


tgacggccca 


gctgttcctg 


atcgtgcagc 


acacicccrcaa 




t c. ccgcgacy 


yybu 


ccatgttgaa 


catcgcggag 


cccgagggtc 


gctggctctt 


ggatctgatt 


aacatcctgc 


10020 


agagcatcgt 


ggtgcaggag 


aggggcctga 


gtttagcgga 


caaggtggcg 


gccattaact 


10080 


attcgatgca 


gagcctgggg 


aagttctacg 


ctcgcaagat 


ctacaagagc 


ccttacgtgc 


10140 


ccatagacaa 


ggaggtgaag 


atagacagct 


tttacatgcg 


catggcgctg 


aaggtgctga 


10200 
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cgctgagcga 


cgacctcggc gtgtaccgta 


acgacaagat 


ccacaaggcg 


gtgagcgcca 


10260 


gccgccggcg 


ggagctgagc 


gacagggagc 


tgatgcacag 


cctgcagagg 


gcgctggcgg 


10320 


gcgccgggga 


cgaggagcgc 
» 


gaggcttact 


tcgacatggg 


agccgatctg 


cagtggcgtc 


10380 


ccagcgcgcg 


cgccttggag. gcggcgggtt 


atcccgacga 


ggaggatcgg 


gacgatttgg 


10.440 


aggaggcagg 


cgagtacgag gacgaagcct 


gaccgggcag 


gtgttgtttt 


agatgcagcg 


10500 


gccggcggac 


gggaccaccg cggatcccgc 


acttttggca 


tccatgcaga 


gtcaaccttc 


10560 


gggcgtgacc 


gcctccgatg actgggcggc 


ggccatggac 


cgcatcatgg 


cgctgaccac 


10620 


ccgcaacccc 


gaggctttta ggcagcaacc 


ccaggccaac 


cgtttttcgg 


ccatcttgga 


10680 


agcggtggtg 


ccgtcgcgca 


ccaacccgac 


gcacgagaaa 


gtcctgacta 


tcgtgaacgc 


10740 


cctggtagac 


agcaaggcca* 


tccgccgtga 


cgaggcgggc 


ttgatttaca 


acgctctttt 


10800 


ggaacgcgcg 


gcgcgctaca 


acagcactaa 


cgtgcagacc 


aatctggacc 


gcctcaccac 


10860 


cgacgtgaag 


gaggcgctgg 


cgcagaagga 


gcggtttctg 


agggacagta 


atctgggctc 


10920 


tctggtggca 


ctgaaegcct 


tcctgagctc 


acagccggcc 


aacgtgcccc 


gcgggcagga 


10980 


ggattacgtg 


agcttcatca 


gcgctctgag 


actgctggtg 


tccgaggtgc 


cccagagcga 


11040 


ggtgtaccag 


tctgggccgg 


attacttttt 


ccagacgtcc 


cgacagggct 


tgcaaacggc 


11100 


gaacctgact 


caggccttta 


aaaacttgca 


aggcatgtgg 


ggggtcaagg 


ccccggtggg 


11160 


cgatcgcgcc 


actatctcca 


gtctgctgac 


ccccaacact 


cgcctgctgc 


tgctcttgat 


11220 


cgcaccgttt 


accaacagta 


gcactatcag 


ccgtgactcg 


tacctgggtc 


atctcatcac 


11280 


tctgtaccgc 


gaggccatcg 


gccaggctca 


gatcgacgag 


catacgtatc 


aggagattac 


11340 


taacgtgagc 


cgtgccctgg 


gtcaggaaga 


taccggcagc 


ctggaagcca 


cgttgaactt 


11400 


tttgctaacc 


aaccggaggc 


aaaaaatacc 


ctcccagttc 


acgttaagcg 


ccgaggagga 


11460 


gaggattctg 


cgatacgtgc 


agcagtccgt 


gagcctgtac 


ttgatgcgcg 


agggcgccac 


11520 


cgcttccacg 


gctttagaca 


tgacggctcg 


gaacatggaa 


ccgtcctttt 


actccgccca 


11580 


ccggccgttc 


attaaccgtc 


tgatggacta 


cttccatcgc 


gcggccgcca 


tgaacgggga 


11640 


gtacttcacc 


aatgccatcc 


tgaatccgca 


ttggatgccc 


ccgtccggct 


tctacaccgg 


11700 


ggagtttgac 


ctgcccgaag 


ccgacgacgg 


ctttctgtgg 


gacgacgtgt 


ccgatagcat 


11760 


tttcacgccg 


gctaatcgcc 


gattccagaa 


gaaggagggc 


ggagacgagc 


tccccctctc 


11820 


cagcgtggaa 


gcggcctcaa 


ggggagagag 


tccctttcca 


agtctgtctt 


ccgccagtag 


11880 
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cggtcgggta 


acgcgtccac ggttgccggg ggagagcgac 




acccctcgct 


11940 


gcgaccggct 


agaaagaaaa attttcccaa taacqqqqtq 


yaddyctcgg 


tggataaaat 


12000 


gaatcgttgg 


aagacgtacg cccaggagca Qcqqqaataa 


ydyy<*c«gc.c 


agccgcggcc. 


12060 


gctggtaccg 


ccgcattggc gtcgccagag agaagacccg 


yauya*. LCCy 


cagacgatag 


12120 


tagcgtgttg 


gacctgggag ggagcggagc caaccccttt 


yvLuaL L tyc 


aacccaaggg 


12180 


gcgctcgagt 


cgcctgtatt aataaaaaaq acqcqqaaac 


f~ ^ ^ ^» ^ /«M 

u LaLvayayC 


catggccaca 


12240 


gcgtgtgtgc 


tttcttcctc tctttcttcc tcaacacaan 


agaacgagaa 


gageggtgag 


12300 


agtcacgccg 


gcggcgtatg agggcccgcc cccttcttac 


era a a t~t tr* *-» — » 

y aaay cgega 


tgggatcagc 


12360 


gaacgtgccg 


gccacgctgg aggcgcctta cqttcctccc 


dydcacccgg 


gaectaegga 


12420 


gggcagaaac 


agcatccgtt actccgagct ggcgcccctg 


taraa t - a r*» a 


ccaaggtgta 


12480 


cctggtggac 


aacaagtcgg cggacatcgc ctccctgaat 


tai,^,daaawy 


atcacagtaa 


12540 


ctttctgact 


accgtggtgc agaacaatga cttcaccccg 


d^yyaggegg 


gcacgcagac 


12600 


cattaacttt 


gacgagcgtt cccqctqqqq caotcaacta 


ddddCCaCCC 


tgcacaccaa 


12660 


catgcccaac 


atcaacgagt tcatqtccac caacaaat-tr 


agggctaagc 


fc 9 a tggtaga 


12720 


aaaaagtaat 


gcggaa'actc qqcaqccccq at accrual- on 


tccgagctta 


ccattccaga 


12780 


gggcaactat 


tccgaaacta tgactatcqa tctcataaat 


dacycgaccg 


tggacaatta 


12840 


cctgcaagtg 


gggagacaga acggggtgct qqaaaqcaat 


dccggcgcga 


aattcgatac 


12900 


cagaaacttc 


cgactggggt gggatcccgt gaccaagctg 




gcgtgtacac 


12960 


caacgaggct 


tttcacccgg acatcgtgct qctqccqqqa 


tyeggegegg 


acttcactca 


13020 


gagccgtttg 


agtaacctgt tagqaattaq aaaacarror 


cccccccaag 


agggctttca 


13080 


aatcatgtat 


gaggacctqq aqqqaqqtaa tafarrrnrr 


ttac tggacg 


tgtcgaagta 


13140 


cgaagctagc 


atacaacgcg ccaaaqcqqa oaataaaaaa 

w w w * v *Z3^"r33 t * yyy *-»yayciy 


atteggggag 


acacctttgc 


13200 


ggtagctccc 


caggacctgq aaataqtacc tthaarhaaa 


gacagcaaag 


acagaagcta 


13260 


caatattata 


aacaacacga cqqacaccct atatcaaaor 


cggctcctgg 


cttacaacta 


13320 


cggagacccc 


qaqaaaqqaq toaaahrahn ns^nsf'a^Hn 

j-jwwmyyjay wyaydtttttg yaCCoCaCCC 


accaccacgg 


acgtgacctg 


13380 


tggctcgcag 


caagtgtact ggtccctgcc ggatatgatg 


caagacccgg 


tcaccttccg 


13440 


cccctccacc 


caagtcagca acttcccggt ggtgggcacc 


gagctgetge 


ccgtccatgc 


13500 


caagagcttc 


tacaacgagc aggccgtcta ctcgcaactt 


attcgecagt 


ccaccgcgct 


13560 


tacccacgtg 


ttcaatcgct ttcccgagaa ccagattctg 


gtgcgccctc 


ccgctcctac 


13620 
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cattaccacc gtcagtgaaa acgttcccgc 


cctcacagat 


cacggaaccc 


tgccgctgcg 


13680 


cagcagtatc agtggagttc agcgcgtgac 


catcaccgac 


gccagacgtc 


gaacctgccc 


13740 


ctacgtttac aaagcgcttg gcgtggtggc 


tcctaaagtt 


ctttctagtc 


gcaccttcta 


13800 


aaaacatgtc catcctcatc tctcccgata 


acaacaccgg 


ctggggactg 


ggctccggca 


13860 


agatgtacgg cggagccaaa aggcgctcca 


gtcagcaccc 


agttcgagtt 


cggggccact 


13920 


tccgcgctcc ttggggagct tacaagcgag 


gactctcggg 


tcgaacggct 


gtagacgata 


13980 


ccatagatgc cgtgattgcc gacgcccgcc 


ggtacaaccc 


cggaccggtc 


gctagcgccg 


14040 


cctccaccgt ggattccgtg atcgacagcg 


tggtagccgg 


cgctcgggcc 


tatgctcgcc 


14100 


gcaagaggcg gctgcatcgg agacgtcgcc 


ccaccgccgc 


catgctggca 


gccagggccg 


14160 


tgctgaggcg ggcccggagg gcaggcagaa 


gggctatgcg 


ccgcgctgcc 


gccaacgccg 


14220 


ccgccgggag ggcccgccga caggctgccc 


gccaggctgc 


cgctgccatc 


gctagcatgg 


14280 


ccagacccag gagagggaac gtgtactggg 


tgcgtgattc 


tgtgacggga 


gtccgagtgc 


14340 


cggtgcgcag ccgacctccc cgaagttaga 


agatccaagc 


tgcgaagacg 


gcggtactga 


14400 


gtctccctgt tgttatcagc ccaacatgag 


caagcgcaag 


tttaaagaag 


aactgctgca 


14460 


gacgctggtg cctgagatct atggccctcc 


ggacgtgaag 


ccagacatta 


agccccgcga 


14520 


tatcaagcgt gttaaaaagc gggaaaagaa 


agaggaactc 


gcggtggtag 


acgatggcgg 


14580 


agtggaattt attaggagtt tcgccccgcg 


acgcagggtt 


caatggaaag 


ggcggcgggt 


14640 


acaacgcgtt ttgaggccgg gcaccgcggt 


agtttttacc 


ccgggagagc 


ggtcggccgt 


14700 


taggggtttc aaaaggcagt acgacgaggt 


gtacggcgac 


gaggacatat 


tggaacaggc 


14760 


ggctcaacag atcggagaat ttgcctacgg 


aaagcgttcg 


cgtcgcgaag 


acctggccat 


14620 


cgccttagac agcggcaacc . ccacgcccag 


cctcaaaccc 


gtgacgctgc 


agcaggtgct 


14880 


tcccgtgagc gccagcacgg acagcaagag 


ggggattaag 


agagaaatgg 


aagatctgca 


14940 


tcccaccatc caactcatgg tccctaaacg 


gcagaggctg 


gaagaggtcc 


tggagaagat 


15000. 


gaaagtggac cccagcatag agccggatgt 


aaaagtcaga 


cctattaagg 


aagtggcccc 


15060 


cggtcttggg gtgcaaacgg tggacattca 


aatccccgtc 


accaccgctt 


caaccgccgt 


15120 


ggaagctatg gaaacgcaaa cggagacccc 


tgccgcgatc 


ggtaccaggg 


aagtggcgtt 


15180 


gcaaacggag ccttggtacg aatacgcagc 


ccctcggcgt 


cagaggcgtt 


ccgctcgtta 


15240 


cggccccgcc aacgccatca tgccagaata 


tgcgctgcat 


ccgtctattc 


tgcccactcc 


15300 
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cggataccgg ggtgtgacgt atcgcccgtc tggaacccgc cgccgaaccc gtcgccgccg 15360 

ccgctcccgt cgcgctctgg cccccgtgtc ggtgcggcgt gtgacccgcc ggggaaagac 15420 

agtcgtcatt cccaacccgc gttaccaccc tagcatcctt taataactct gccgttttgc 15480 

agatggctct gacttgccgc gtgcgccttc ccgttccgca ctatcgagga agatctcgtc 15540 

gtaggagagg catgacgggc agtggtcgcc ggcgggcttt gcgcaggcgc atgaaaggcg 15600 

gaattttacc cgccctgata cccataattg ccgccgccat cggtgccata cccggcgttg 15660 

cttcagtggc gttgcaagca gctcgtaata aataaacaaa ggcttttgca cttatgacct 15720 

ggtcctgact attttatgca gaaagagcat ggaagacatc aattttacgt cgctggctcc 15780 

gcggcacggc tcgcggccgc tcatgggcac ctggaacgac atcggcacca gtcagctcaa 15840 

cgggggcgct ttcaattggg ggagcctttg gagcggcatt aaaaactttg gctccacgat 15900 

taaatcctac ggcagcaaag cctggaacag tagtgctggt cagatgctcc gagataaact 15960 

gaaggacacc aacttccaag aaaaagtggt caatggggtg gtgaccggca tccacggcgc 16020 

ggtagatctc gccaaccaag cggtgcagaa agagattgac aggcgtttgg aaagctcgcg 16080 

ggtgccgccg cagagagggg atgaggtgga ggtcgaggaa gtagaagtag aggaaaagct 16140 

gcccccgctg gagaaagttc ccggtgcgcc tccgagaccg cagaagcggc ccaggccaga .16200 

actagaagag actctggtga cggagagcaa ggagcctccc tcgtacgagc aag;ccttgaa 16260 

agagggcgcc tctccaccct cctacccgat gactaagccg atcgcaccca tggctcgacc 16320 

ggtgtacggc aaggattaca agcccgtcac gctagagctg cccccaccgc cccccacgcg 16380 

cccgaccgtc ccccccctgc cgactccgtc ggcggccgcg gcgggacccg tgtccgcacc 1644 0 

atccgctgtg cctctgccag ccgcccgtcc agtggccgtg gccactgcca gaaaccccag 16500 

aggccagaga ggagccaact ggcaaagcac gctgaacagc atcgtgggcc tgggagtgaa 16560 

aagcctgaaa cgccgccgtt gctattatta aaaaagtgta gctaaaaagt ctcccgttgt 16620 

atacgcctcc tatgttaccg ccagagacga gtgactgtcg ccgcgagcgc cgctttcaag 16680 

atggccaccc catcgatgat gccgcagtgg tcttacatgc acatcgccgg ccaggacgcc 16740 

tcggagtacc tgagtcccgg cctcgtgcag tttgcccgcg ccaccgacac ctacttcagc 16800 

ttgggaaaca agtttagaaa ccccaccgtg gcccccaccc acgatgtgac cacggaccgc 16860 

tcgcagaggc tgaccctgcg ctttgtgccc gtagaccggg aggacaccgc gtactcttac 16920 

aaagtgcgct acacgttggc cgtaggggac aaccgagtgc tggacatggc cagcacctac 16980 

tttgacatcc ggggggtgct ggatcggggt cccagcttca agccctattc cggcaccgct 17040 
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tacaactccc tggcccccaa gggagctccc aacccctcgg aatggacgga cacttccgac 17100 

aacaaactta aagcatatgc tcaggctccc taccagagtc aaggacttac aaaggatggt 17160 

attcaggttg ggctagttgt gacagagtca ggacaaacac cccaatatgc aaacaaagtg 17220 

taccaacccg agccacaaat tggggaaaac caatggaatt tagaacaaga agataaagcg 17280 

gcgggaagag tcctaaagaa agatacccct atgtttccct gctatgggtc atatgccagg 17340 

cccacaaacg aacaaggagg gcaggcaaaa aaccaagaag tagatttaca gttttttgcc 17400 

actccgggcg acacccagaa cacggctaaa gtggtacttt atgctgaaaa tgtcaacctg 17460 

gaaactccag atactcactt agtgtttaaa cccgatgacg acagcaccag ttcaaaactt 17520 

cttcttgggc agcaggctgc acctaacaga cccaactaca taggttttag agataatttt 17580 

attggtttaa tgtactaca.a tagcactgga aacatgggcg tgctggccgg acaggcttct 17640 

caattgaatg ccgtagtcga cttgcaggac agaaacaccg agttgtccta ccagctgatg 17700 

ctggacgcac tgggggatcg cagccgatat ttttcaatgt ggaatcaggc agtagacagc 17760 

tatgacccag acgttagaat tatagaaaac cacggagtgg aagacgaact gccaaactat 1782 0 

tgttttcctc tgggaggaat ggtggtgact gacaattaca actctgtgac gcctcaaaat 17880 

ggaggcagtg gaaatacatg gcaggcagac aatactacat ttagtcaaag aggagcgcag 17940 

attggctccg gaaacatgtt tgccctggaa attaacctac aggccaacct ctggcgcggc 18000 

ttcttgtatt ccaatattgg gttgtatctt ccagactctc tgaaaatcac ccccgacaac 18060 

atcacgctgc cagaaaacaa aaacacttat cagtacatga acggtcgcgt aacgccaccc 18120 

gggctcatag acacctatgt aaacgtgggc gcgcgctggt cccccgatgt catggacagc 18180 

attaacccct tcaaccacca ccgtaacgcg ggcttgcgct accgctccat gctcttgggc 18240 

aacggccgtt atgtgccttt tcacattcag gtgccccaaa aattctttgc cattaaaaac 18300 

ctgctgcttc tccccggttc ctatacctat gagtggaact tccgcaagga tgtcaacatg 18360 

atcctgcaga gctcgctggg taatgacctg cgagtggacg gggccagcat acgctttgac 18420 

agcattaacc tgtatgccaa cttttttccc atggcccaca acacggcctc taccctggaa 18480 

gccatgctgc gcaacgacac caatgaccag tccttcaacg actacctgtg cgcggctaac 18540 

atgctgtacc ccatccccgc caacgccacc agcgtgccca tttctattcc ttctcggaac 18600 

tgggctgcct tcaggggctg gagttttact cgcctcaaaa ccaaggagac tccctcgctg 18660 

ggctccggtt ttgaccccta ctttgtttac tccggctcca ttccctacct agatggcacc 18720 
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ttttacctca accacacttt caaaaaggtg tctattatgt ttgactcctc ggttagctgg 187B0 

cccggcaacg accgcctgct aacgcccaac gagttcgaaa ttaagcgttc cgtggacggt 18840 

gaagggtaca acgtggccca gagcaacatg accaaggact ggtttctaat tcaaatgctc 18900 

agtcactata atataggtta ccagggcttc tatgtgcccg agaactacaa ggaccgcatg 18960 

tactccttct tccgcaactt ccaaccaatg agccggcagg tggtagatac cgtgacttat 19020 

acagactaca aagatgtcaa gctcccctac caacacaaca actcagggtt cgtgggctac 19080 

atgggaccca ccatgcgaga gggacaggcc tacccggcca actatcccta ccccctgatc 19140 

ggagagactg ccgtacccag cctcacgcag aaaaagttcc tctgcgaccg ggtgatgtgg 19200 

aggataccct tctctagcaa ctttatgtcg atgggctccc tcaccgacct ggggcagaac 19260 

atgctgtacg ccaactccgc tcacgccttg gacatgactt ttgaggtgga tcccatggat 19320 

gagcccacgc ttctctatgt tctgtttgaa gtcttcgacg tggtgcgcat ccaccagccg 19380 

caccgcggcg tcatcgaggc cgtctacctg cgcacacctt tctctgccgg taacgccacc 19440 

acctaaagaa gctgatgggt tccagcgaac aggagttgca ggccattgtt cgcgacctgg 19500 

gctgcgggcc ctgctttttg ggcaccttcg acaagcgttt tcccggattc atgtcccccc 19560 

acaagccggc ctgcgccatc gttaacacgg ccggacggga gacagggggg gtgcactggc 19620 

tcgccttcgc ctggaacccg cgcaaccgca cctgctacct gttcgaccct tttggtttct 19680 

ccgacgaaag gctgaagcag atctaccaat tcgagtacga ggggctcctc aagcgcagcg 19740 

ctctggcctc cacgcccgac cactgcgtca ccctggaaaa gtccacccag acggtccagg 19800 

ggcccctctc ggccgcctgc gggcttttct gttgcatgtt tttgcacgcc ttcgtgcact 19860 

ggcctcacac ccccatggag cgcaacccca ccatggatct gctcaccgga gtgcccaaca 19920 

gcatgcttca cagtccccag gtcgccccca ccctgcgtcg caatcaggac cacctgtatc 19980 

gctttctggg gaaacactct gcctatttcc gccgccaccg gcagcgcatc gaacaggcca 20040 

cggccttcga aagcatgagc caaagagtgt aatcaataaa aaccgttttt atttgacatg 20100 

atacgcgctt ctggcgtttt tattaaaaat cgaagggttc gagggagggg tcctcgtgcc 20160 

cgctggggag ggacacgttg cggtactgga atcgggcgct ccaacgaaac tcggggatca 20220 

ccagccgcgg cagggccacg tcttccatgt tctgcttcca aaactgtcgc accagctgca 20280 

gggctcccat cacgtcgggc gctgagatct tgaagtcgca gttagggccg gagcccccgc 20340 

ggctgttgcg gaacacgggg ttggcacact ggaacaccaa cacgctgggg ttgtggatac 20400 

tagccagggc cgtcgggtcg gtcacctccg atgcatccag atcctcggca ttgctcaggg 20460 
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cgaacggggt 


cagcttgcac 


atctgccgcc cgatctgggg 


taccaggtcg 


cgcttgttga 


20520 


ggcagtcgca 


gcgcagaggg 


atgaggatgc gacgctgccc 


gcgttgcatg 


atggggtaac 


20580 


tcgccgccag 


gaactcctct 


atetgaegga aggecatctg 


ggccttgacg 


ccctcggtga 


20640 


aaaatagccc 


acaggacttg 


ctggaaaaca cgttattgcc 


acagttgatg 


tcttccgcgc 


20700 


agcagcgcgc 


atettegtte 


ttcagctgaa ccacgttgcg 


accccagcgg 


ttctgaacca 


20760 


ccttggcttt 


cgtgggatgc 


tccttcagcg cccgctgtcc 


gttcccgctg 


gtcacatcca 


20820 


tttccaccac 


gtgctccttg 


cagaccatct ccactccgtg 


gaaacagaac 


agaatgeect 


20880 


cctgttgggt 


attgegatge 


tcccacacgg cgcacccggt 


ggactcccag 


ctcttgtgtt 


20940 


tcacccccgc 


gtaggcttcc 


atgtaagcca ttagaaatct 


gcccatcagc 


tcagtgaagg 


21000 


tcttctggtt 


ggtgaaggtt 


ageggcagge cgcggtgttc 


ctcgttcaac 


caagtttgac 


21060 


agate ttgcg 


gtacaegget 


ccctggtcgg gcagaaactt 


aaaagtcgtt 


ctgctctcgt 


21120 


tgtccacgtg 


gaacttctcc 


atcaacatcg tcatgacttc 


catgcccttc 


tcccaggcag 


21180 


tcaccagcgg 


cgcgctctcg 


gggttcttca ccaacacggc 


ggtggagggg 


ccctcgccgg 


21240 


ccccgacgtc 


cttcatggac 


attttttgaa actccacggt 


gccgtccgcg 


cggcgtactc 


21300 


tgegcategg 


agggtagctg 


aagcccacct ecatgaeggt 


gctttcgccc 


tegctgtegg 


21360 


agacgatctc 


eggggaggge 


ggeggaaegg gggcagactt 


gcgagccttc 


ttcttgggag 


21420 


ggageggagg 


cacctcctgc 


tcgcgctcgg gactcatctc 


cegcaagtag 


ggggtgatgg 


21480 


agcttcctgg 


ttggttctga 


cggttggcca ttgtatccta 


ggcagaaaga 


catggagctt 


21540 


atgegegagg 


aaactttaac 


cgccccgtcc cccgtcagcg 


acgaagaggt 


categtcgaa 


21600 


caggacccgg 


getaegttae 


gccgcccgag gatctggagg 


ggeccttaga 


cgaccggcgc 


21660 


gaegctagtg 


ageggcagga 


aaatgagaaa gaggaggagg 


agggctgeta 


cctcctggaa 


21720 


ggcgacgttt 


tgetaaagea 


tttcgecagg cagagcacca 


tactcaagga 


ggccttgcaa 


21780 


gaccgctccg 


aggtgecett 


ggacgtcgcc gcgctctccc 


aggectaega 


ggegaacett 


21840 


ttctcgcccc 


gagtgcctcc 


gaagagacag cccaacggca 


cctgcgagcc 


caacccgcga 


21900 


ctcaacttct 


accccgtgtt 


cgccgtgccc gaggegctgg 


ccacctacca 


catctttttc 


21960 


aaaaaccagc 


gcattcccct 


ttcctgccgg gccaaccgca 


ccgcggccga 


taggaagcta 


22020 


acactcagaa 


aeggagtcag 


catacctgat atcacgtcac 


tggaggaagt 


gectaagate 


22080 


ttcgagggtc 


tgggtcgaga 


tgagaagegg gcggcgaacg 


etctgeagaa 


agaacagaaa 


22140 
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gagagtcaga 


acgtgctggt 


ggagctggag 


ggggacaacg 


cgcgtctgac 


cgtcctcaaa 


22200 


cgttgcatag 


aagtttccca 


cttcgcctac 


ccggccctca 


acctgccgcc 


caaagttatg 


22260 


aaatcggtca 


tggaccagct 


actcatcaag 


agagctgagc 


ccctgaatcc 


cgaccaccct 


22320 


gaggcggaaa 


actcagagga 


cggaaagccc 


gtcgtcagcg 


acgaggagct 


cgagcggtgg 


22380 


ctggaaacca 


gggaccccca 


gcagttgcaa 


gagaggcgca 


agatgatgat 


99cggccgtg 


22440 


ctggtcacgg 


tggagctaga 


atgcctgcaa 


cggtttttca 


gcgacgtgga 


gacgctacgc 


22500 


aaaatcgggg 


agtccctgca 


ctacaccttc 


cgccagggct 


acgttcgcca 


ggcctgcaaa 


22560 


atctccaacg 


tagagctcag 


caacctggtt 


tcctacatgg 


gcatcctcca 


cgagaaccgg 


22620 


ctggggcaga 


gcgtgctgca 


ctgcaccttg 


caaggcgagg 


cgcgaaggga 


ctacgtccga 


22680 


gactgcgtct 


acctcttcct 


caccctcacc 


tggcagaccg 


ccatgggcgt 


gtggcagcag 


22740 


tgcttggaag 


agagaaacct 


caaagagctg 


gacaaactcc 


tctgccgcca 


gcggcgggcc 


22800 


ctctggaccg 


gcttcagcga 


gcgcacggtc 


gcctgcgccc 


tggcagacat 


cattttccca 


22860 


gaacgcctga 


tgaaaacctt 


gcagaacggc 


ctgccggatt 


tcatcagtca 


gagcatcttg 


22920 


caaaacttcc 


gctcctfccgt 


cctggagcgc 


tccgggatct 


tgcccgccat 


gagctgcgcg 


22980 


ctgccttctg 


actttgtccc 


cctttcctac 


cgcgagtgcc 


ctcccccact 


gtggagccac 


23040 


tgctacctct 


tccaactggc 


caactttctg 


gcctaccact 


ccgacctcat 


ggaagacgtg 


23100 


agcggagagg 


ggctgctcga 


gtgccactgc 


cgctgcaacc 


tctgcacccc 


ccacagatcg 


23160 


ctggcctgca 


acaccgagct 


gctcagcgaa 


acccaggtca 


taggtacctt 


cgagatccag 


23220 


gggccccagc 


agcaagaggg 


tgcttccggc 


ttgaagctca 


ctccggcgct 


gtggacctcg 


23280 


gcttacttac 


gcaaatttgt 


agccgaggac 


taccacgccc 


acaaaattca 


gttttacgaa 


23340 


gaccaatctc 


gaccaccgaa 


agcccccctc 


acggcctgcg 


tcatcaccca 


gagcaaaatc 


23400 


ctggcccaat 


tgcaatccat 


caaccaagcg 


cgccgagatt 


tccttttgaa 


aaagggtcgg 


23460 


ggggtgtacc 


tggaccccca 


gaccggcgag 


gaactcaacc 


cgtccacact 


ttccgtcgaa 


23520 


gcagcccccc 


cgagacatgc cacccaaggg 


aaccgccaag 


cagctgatcg 


ctcggcagag 


23580 


agcgaagaag 


caagagctgc 


tccagcagca 


ggtggaggac 


gaggaagagc 


tgtgggacag 


23640 


ccaggcagag 


gaggtgtcag aggacgagga 


ggagatggaa 


agctgggaca 


gcctagacga 


23700 


ggaggacgag 


ctttcagagg aagaggcgac 


cgaagaaaaa 


ccacctgcat 


ccagcgcgcc 


23760 


ttctctgagc 


cgacagccga 


agccccggcc 


cccgacgccc 


ccggccggct 


cactcaaagc 


23820 


cagccgtagg 


tgggacgcca 


ccggatctcc 


agcggcagcg 


gcaacggcag 


cgggtaaggc 


23880 
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eggegggggt 


attgetcctg 


gcggacccac 


aaaagcagta 


tcgtgaactg 


23940 




tgcgggggaa 


acatctcctt 


tgcccgacgc 


tacctcctct 


tccatcacgg 


24000 


tgcggccctc 


cctcgcaacg 


ttctctatta 


ttacegtcat 


ctctacagcc 


cctacgaaac 


24060 


get eg gaga a 


aaaag'ctaag 


gcctcctctg 


ccgcgaggaa 


aaactccgcc 


gccgctgccg 


24120 


ccaaggatcc 


gccggccacc 


gaggagctga 


gaaagegcat 


ctttcccact 


ctgtatgcta 


24180 


tctttcagca 


aagccgcggg 


cagcaccctc 


agegegaact 


gaaaataaaa 


aaccgctcct 


24240 


tccgctcact 


cacccgcagc 


tgtctgtacc 


acaagagaga 


agaccagctg 


cagcgcaccc 


24300 


tggacgaege 


cgaagcactg 


ttcagcaaat 


actgctcagc 


gtctcttaaa 


gactaaaaga 


24360 


cccgcgcttt 


ttccccctcg 


ggcgccaaaa 


cccacgtcat 


cgccagcatg 


agcaaggaga 


24420 


ttcccacccc 


ttacatgtgg 


agctatcagc 


cccagatggg 


cctggccgcg 


ggggccgccc 


24480 


aggactactc 


cagcaaaatg 


aactggctca 


gcgccggccc 


ccacatgatc 


tcacgagtta 


24540 


acggcatccg 


agcccaccga 


aaccagatcc 


tcttagaaca 


ggeggcaate 


accgccacac 


24600 


cccggcgcca 


actcaacccg 


cccagttggc 


ccgccgccca 


ggtgtatcag 


gaaactcccc 


24660 


gcccgaccac 


agtcctcctg 


ccacgcgacg 


eggaggcega 


agtcctcatg 


actaactctg 


24720 


gggtacaatt 


agegggeggg 


tccaggtacg 


ccaggtacag 


aggtegggee 


gctccttact 


24780 


ctcccgggag 


tataaagagg 


gtgatcattc 


gaggecgagg 


tatccagctc 


aacgacgagg 


24840 


eggtgagetc 


ctcaaccggt 


ctcagacctg 


aeggagtect 


ccagctcgga 


ggagegggee 


24900 


gctcttcctt 


caccactcgc 


caggcctacc 


tgaccctgca 


gage tct tec 


tcgcagccgc 


24960 


getceggggg 


aateggcact 


ctccagttcg 


tggaagagtt 


cgtcccctcc 


gtctacttca 


25020 


acccgttttc 


cggctcacct 


ggacgctacc 


cggacgcctt 


cattcccaac 


tttgacgcag 


25080 


tgagtgaatc 


cgtggacggc 


tacgactgat 


gacagatggt 


gcggccgtga 


gagctegget 


25140 


gcgacatctg 


catcactgcc 


gccagcctcg 


ctgctacgct 


egggaggega 


tcgtgttcag 


25200 


ctactttgag 


ctgccggacg 


agcaccctca 


gggacegget 


cacgggttga 


aactcgagat 


25260 


tgagaacgcg 


cttgagtctc 


acctcatcga 


cgccttcacc 


gcccggcctc 


tcctggtaga 


25320 


aaccgaacgc 


gggatcacta 


ccatcaccct 


gttctgeate 


tgccccacgc 


ceggattaca 


25380 


tgaagatctg 


tgttgtcatc 


tttgegctea 


gtttaataaa 


aactgaactt 


tttgeegtae 


25440 


cttcaacgcc 


acgcgttgtt 


tctccttgtg 


aaaaaacccc 


aggagtcctt 


aacttacaca 


25500 


tagcaaaacc 


cttgtatttt 


accatagaaa 


aacaactagc 


cctttcaatt 


ggaaaagggt 


25560 



too 



taacaatttc 


tgctacagga cagttggaaa 


gcacagcaag 


cgtacaaaac 


oy wy w taLdC 


*5 c CO A 


cacccctacg 


tggtatttcc cctttaaagc 


tgacagacaa 


cggtttaaca 




« jDOv 


cagatcccct 


gcgtgtggta ggtgaccaac 


ttacgtttaa 


ttttacttct 


V w O w L. O wy L. L. 




acgaaaatgg 


cagtcttaca ttcaactaca 


cttctcccat 


gacactaata 




«3t)UU 


ttgctattaa 


cgtcaatacc 


tccaaaggcc 


tcagtagtga 


caacoQcaca 


v-u wy w tyLda 


Z3 OOU 


atgttactcc 


agattttaga tttaacagct 


ctggtgcctt 


aactt ttaac 






tatggacttt 


tccaaccaaa 


actcctaact 


gtaccgtgtt 


taccaaaaat" 


y at tccc ty C 




tgagtctttg 


cttgactaaa 


tgcggagctc 


acgtacttgg 


aaacqtaaat 




ZDU4U 


tggcaggaac 


catgctaaaa 


atgacccaca 


cttctgctac 


^»y u irv.ay uul 


tcgctxgatg 


26100 


acagtggtaa 


actaatattc 


tctccacttg 


cgaacaacac 




cgacaaagcg 


26160 


agagtccgtt 


gcccaaccca 


tccttcaacg 


ctctcacgtt 


tatcrccaaacr 


anh ar*r»a t- t- •> 
ayLotCCaLtk 


2 bZ2 0 


attctagagg 


agcaagtaac 


gaacctcaaa 


acaattatta 


tQtccaaara 




26280 


gcaacgtgcg 


aaagccaatt 


ctaccaactg 


ttacctacaa 


ctcaattaat 


Ltd ygacact 


2634 0 


ccttaacttt 


taaatgggat 


gctgtcgcca 


atgaaaaatt 


tQccactcct 


a^dLtLLLyu 


OCA ft A 


tttgctatgt 


tgcagagcaa 


taaaaccctg 


ttaccccacc 


Qtctcat ttt 

3: w v. wy w L W V 


Lttcagacya 


26460 


aacgagcgag 


agttgatgaa gacttcaacc 


cagtgtaccc 


ttatgacccc 


ccata car* t~ r» 




ccgtcatgcc 


cttcattact 


ccgcctttta 


cctcctcgga 


tQorqttorcaa 




ZODO U 


ttggagtgtt 


aagtttaaac 


tacagggatc 


ccatcactac 


acaaaaCggg 


tcf rtrarnt- 


200<* U 


taaaactagg 


aaacggcctc 


actctaaaca 


accagggaca 


QttaacatcA 


a w i.y w t.yycy 


2b /UU 


aagtggagcc 


tccgctcact aatgctaaca 


acaaacttgc 




agcgaaccac 


26760 


tagcagtaaa 


aagcaaccgc 


ctaactctat 


cacacaccgc 


Vwwwwwl»yi.w 


atcgctaata 


26820 


attctttagc 


gttgcaagtt 


tcagagccta 


tt t ttgtaaa 




aagc tiagccc 


26880 


tgcagacagc 


cgcccccctt 


gtaaccaacg 


ctggcaccct 


tccicttarao 


sgcgccgccc 


26940 


ctttaggatt 


ggttgaaaat 


actcttaaac 


tgctgttttc 


taaaccctta 


Ldtttycaaa 


27000 


atgattttct 


tgcattagcc 


attgaacgcc 


ccccggccgc 


agcagccgca 


ggtactctga 


27060 


ccctacaact 


tactcctcca 


ttaaagacta 


acgatgacgg 


gctaacacta 


tccacagtcg 


27120 


agccattaac 


tgtaaaaaac ggaaacctag 


gcttgcaaat 


atcgcgccct 


ttagttgttc 


27180 


aaaacaacgg 


cctttcgctt gctattaccc 


ccccgctgcg 


tttgtttaac 


agcgaccccg 


27240 


ttcttggttt 


gggcttcact tttcccctag 


ctgtcacaaa 


caacctcctc 


tccttaaaca 


27300 
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tgggagacgg agttaaactt acctataata aactaacagc caatttgggc agggatttac 27360 

aatttgaaaa cggtgcgatt gccgtaacgc ttactgccga attacctttg caatacacta 27420 

acaaacttca actgaatatt ggagctggcc ttcgttacaa tggagccagc agaaaactag 27480 

atgtaaacat taaccaaaat aaaggcttaa cttgggacaa cgatgcagtt attcccaaac 27540 

taggatcggg cttacaattt gaccctaatg gcaacatcgc tgttatccct gaaaccgtga 27600 

agccgcaaac gttatggacg actgcagatc cctcgcctaa ctgctcagtg taccaggact 27660 

tggatgccag gctgtggctc gctcttgtta aaagtggcga catggtgcat ggaagcattg 27720 

ccctaaaagc cctaaaaggg acgttgctaa atcctacagc cagctacatt tccattgtga 27780 

tatattttta cagcaacgga gtcaggcgta ccaactatcc aacgtttgac aacgaaggca 27840 

ccttagctaa cagcgccact tggggatacc gacaggggca atctgctaac actaatgtga 27900 

ccaatgccac tgaatttatg cccagctcaa gcaggtaccc cgtgaataaa ggagacaaca 27960 

ttcaaaatca atctttttca tacacctgta ttaaaggaga ttttgctatg cctgtcccgt 28020 

tccgtgtaac atataatcac gccctggaag ggtattccct taagttcacc tggcgcgttg 28080 

tagccaatca ggcctttgat attccttgct gttcattttc atacatcaca gaataaaaaa 28140 

ccactttttc attttaattt ctttttattt tacacgaaca gtgagacttc ctccaccctt 28200 

ccatttgaca gcatacacca gcctctcccc cctcatagca gtaaactgtt gtgaatcagt 28260 

ccggtatttg ggagttaaaa tccaaacagt ctctttggtg atgaaacgtc gatcagtaat 28320 

ggacacaaat ccctgggaca ggttttccaa cgtttcggtg aaaaactgca caccgcccta 28380 

caaaacaaac aggttcaggc tctccacggg ttatctcccc gatcaaactc agacagggta 28440 

aaggcgcggt ggtgttccac taaaccacgc aggtggcgct gtctgaacct ctcggtgcga 28500 

ctcctgtgag gctggtaaga agttagattg tccagtagcc tcacagcatg tatcatcagt 28560 

ctacgagtgc gtctggcgca gcagcgcatc tgaatctcac tgagattccg gcaagaatcg 28620 

cacaccatca caatcaggtt gttcatgatc ccatagctga acacgctcca gccaaagctc . 28680 

attcgctcca acagcgccac cgcgtgtccg tccaacctta ctttaacata aatcaggtgt 28740 

ctgccgcgta caaacatgct acccacatac agaacttccc ggggcaggcc cctgttcacc 28800 

acctgtctgt accagggaaa cctcacattt atcagggagc catagatggc cattttaaac 28860 

caattagcta ataccgcccc accagctcta cactgaagag aaccgggaga gttacaatga 28920 

cagtgaataa tccatctctc ataacccctg atggtctgat gaaaatctag atctaacgtg 28980 
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gcacaacaaa tacacacttt catatacafct ttcacaacat ^ 

atacaatccc aa t acac 399 ccactcctgc a.tacaataa a 9 c t aa t aca a g at 9gtata 29100 

ctcctcacct cac t9ac ac t gt9catgttc atattttcac attctaagta ^ 

tcctctacag ca 9 cac t9 ct 9 c 99t cctca caaggtggta gc tgg tgatg attgtagggg 29220 

9cc a9 tct 9 c a 9 c 9at acc 9 tct 9 tc 9 c 9 t t9 catc gt a 9 accaggaacc gacgcacctc 29280 

c t c 9t act tg t ggtagcaga accacgtccg ctgccagcac gtctccacgt aacgccggtc 2934Q 

cct 9 c gt c g c tc a c gct ccc t cctc aatgc aaagtgcaac cactcttgta atccacacag 29400 

atccctctcg 9 cctcc gggg tgatgcacac ctcaaaccta cagatgtctc ggtacagtfcc 2946o 

caaacac 3ta gtgagggcga gttccaa cc a aga ca 3 aca 3 cc tgatctat cccgac.cac 29520 

t9 9 a 9gt9ga ggaagacacg gaagaggcat 9ttattccaa gcgattcacc aacgggtcga 29580 

aa t9aagatc CCgaa9atga caacggtcgc c tC c 99agcc c tgatggaat ttaacagcca 29640 

g a tcaaacgt tatgcgattc C cc aagctat cga cc g cc gc ttcc aaaaga gC c tggac cc 29700 

g cac tt cc a c a aa cacc agc a aagcaaaag cacCattatc aaactcCtca atcatcaagc ^ 

t 9ca 99act9 t acaatg cc t aagtaatttt cgtctctcca ctcgcgaafcg afcgtcgcggc 2982Q 

a g a tagtcCg aaggttcatc ccgtgcaggg . taaaaagctc cgaaagggcg ccctcfcacag . 29880 

cca tgC9tag acacaccatc atgactg c aa 3atatcgggc tcctgagaca cctgcagcag 29940 

att taacaga tcaaggtcag gttgc tc t cc g c 9atca c ga atct cc at cc 9 c a a 9gtcat 30000 

ttgCaaaa aa t C aa ataaat ctatgccgac tagatctgtc aactccgcat taggaaccaa ^ 
atca ggtgtg gctacg c ag c acaaaagtt c c agggaCggt gccaaactca ctagaaccgc 

tccc gagtaa c aaaactgat gaatgggagt aaca ca gtgt aaaatgtgca accaaaaatc 30180 

ac taaggtgc tcctttaaaa agCC c agtac ttctatattc a gt cc gtgca agtactgaag 30240 

caac tgtgcg Sgaatatg c a ca acaaaaaa aatagggcgg ctcagataca Cgttgaccta 303OO 

aaataaaaag aatcattaaa. c taaagaagc ttggcgaacg gtgggataaa tgacacgctc 30360 

cagcagcaga- caggcaaccg gctgtccccg 9gaa ccgcgg taaaat tc at ccgaatgatt 30420 

a aaaa3aaca acagaaactt cccaccatgt actcggttgg a tc t cct gag cacacagcaa 30480 

taccccccto aca ttcat g t ccgc Ca c aga aaaaaaacgt ccc agataC c ca 9 c 9gggat 30540 

atcc aacga c agctgcaaag ac agcaaaac a.tccctctg ggagcgatca c aaaa tcct:c 30600 

c ggtgaaaaa agca cataca tattagaata accc tgtt g C t gg ggc aaaa .ggcccggcg 30660 

tcccagcaaa tgcacataaa tatgttcatc agccattgcc ccgtcttacc gcgtaatcag 30720 
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ccacgaaaaa atcgagctaa aattcaccca aca 3 cctata gctatatata cactcegccc 307B0 

aatgacgcta ataccgcacc acccacgacc aaag tCC acc cacacccaca aaacccgcga 30840 

aaatccagcg ccgccagcac ttccgcaatt tcagtctcac aacg tC ac tt ccgcgcgcct 30900 

tttcacattc ccacacacac ccgcgccctt cgccccgccc tcg.gccacc ccgcg t cacc 309SO 

gcacgtcacc ccggccccgc ctcgctcctc cccgctcatt atcatattgg cacgtttcca 31020 
gaataaggta tattattgat gatg 



<210> 6 

<211> 34115 

<212> DNA 

<213> simian adenovirus SV-39 

<400> 6 



catcatcaat ataacaccgc aagatggcga ccgagttaac atgcaaatga ggtgggcgga 
gttacgcgac ctttgtcttg ggaacgcgga agtgggcgcg gcgg gtttC g gggaggagcg 
cggggcgggg cgggcgtgtc gcgcggcggt gacgcgccgg ggacccggaa attgagtagt 
ttttattcat tttgcaagtt tttctgtaca ttttggcgcg aaaactgaaa cgaggaagtg 
aaaagtgaaa aatgccgagg tagtcaccgg gtggagatcc gacctttgcc gtgtggagtt 
tacccgctga cgtgtgggtt tcggtctcta ttccttcact gtggttttcc gggtacggtc 
aaaggtcccc attttatgac tccacgtcag ctgatcgcta gggtatttaa tgcgcctcag 
accgtcaaga ggccactctt gagtgccggc gagaagagtt ttctcctccg cgttccgcca 
actgtgaaaa aatgaggaac ttcttgctac ctccggggct gccagcgacc gtagccgccg 
agctgttgga ggacattgtt accggagctc tgggagacga tcctcaggtg atttctcact 
tttgtgaaga ttttagtctt catgatctct atgatattga tccgggtgtt gaggggcaag 
aggatgaatg gctggagtct gtggatgggt tttctccgga cgctatgctg ctagaggctg 
atttgccacc acctcacaac tctcacactg agcccgagtc agctgctatt cctgaattgt 
catcaggtga acttgacttg gcttgctacg agactatgcc tc.ggagtcg gatgaggagg 
acagcgggat cagcgatccc acggc tttCa tggtctctaa ggcgattgct atactaaaag 
aagatgatga tggcgatgat ggatttcgac tggacgctcc ggcggtgccg gggagagact 
gtaagtcctg tgaataccac cgggatcgta ccggagaccc gtctatgctg tgttctctgt 
gttatctccg tcttaacgct gcttttgtct acagtaagtg ttttgtgctt ttttaccctg 
tggctttgtt gagtttattt ttttctgtgt ctcatagggt g ttgttt a tt ataggtcctg 



31044 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
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tttcagatgt ggaggaacct 
cgaaactaac tcagcgctgc 
cccggaaacg tgctgctgtt 
tggacttgtc cttaaagcga 
agcttccggg tgactcagtg 
tacgggcggg aagggattag 
atcgagtgag aagttccatg 
gtttgctgga ggaggcctcc 
ctctgagtcg ctttttgcac 
tagagcagct gcctggactg 
agaggctttt tccacaattg 
ttgctgtaca tctgttggac 
tggacttcct gacgctatgc 
ggcgcttggt ggagaggcat 
tgaggcggga ggatctggaa 
cgagagcggg gctggaccct 
gttggctttg ggggcggtgg 
acgcggtcag aagagggagc 
cctgatggcg aaaaacaggc 
gaagggggac ttgcacctgc 
gaacccggat gaggactggg 
cgactgtcgc taccaggttc 
caacggcgcc accgtggaga 
agaaatgtgc cctggggtgg 
cggtagcaat tttgggggtg 
ctactttttt ggcttcagca 
gtgttccttt tacgcttgct 
tcacaagtgt atgttggaga 



gatagtacta ctggaaatga ggaggaaaag 
agacctaata ttttgagacc ctcggcccag 
aattgcatag aagatttatt ggaagagccc 
ccccgcccgc agtagggcgc ggtgccagtt 
caataaaaat tttcttggca acaggtgtat 
gggagtataa agctggaggg gaaaaatctg 
gacttgtacg agagcctaga gaatctaagt 
gacagaacct cttacatttg gaggtttctg 
cgggtgaagc gagagcacct gacggaattt 
tttgattctt tgaatctcgg ccaccggacg 
gacttttcct ctccaggccg tctgtgttca 
agatggaacg agcagacgca gctcagcccg 
ctatggaagt tcggaatcag gagggggagg 
ccgtctctgc gccagcagcg tctgcaagct 
gccatttcgg aggaggagag cggcatggaa 
ccggcggagg agtagggggg ataccggacc 
ggggcgcttc tgtggtacgt gaggatgaag 
attttgagtc ctcgactttc ttggctgatg 
tggaggtggt gtggtacccg gaagtatggg 
tggaaaaata taactttgag caggtgaaaa 
aggtggtttt gaaccgatac ggcaaggtag 
gcgacaaggt ggtcctgcga cgcaacgtgt 
tggtggaccc cagaaggggt ggttttgtgg 
tgggcttgtc tggggtgact tttcatagtg 
tggttattac cgcgaacact cctgtggtcc 
acacctgtgt ggaaatgagg gtgggaggca 
ggaagggggt ggtgagccag ggtaaggcta 
gatgcacctt gggcatttcc agtgagggct 



ccctccccgc 


1200 


cgtgtgtcat 


1260 


actgaacctt 


1320 


ttttctctct 


1380 


gtgtttactt 


1440 


aggctgtcag 


1500 


tctttgcgac 


1560 


ttcggttccc 


1620 


gatgggcttt 


1680 


ctgctagagg 


1740 


gcgcttgctt 


1800 


ggttacactc 


1860 


aagctgtacg 


1920 


caagtgctgc 


1980 


gagaagaatc 


2040 


cttttcctga 


2100 


aggggcgcca 


2160 


taaccgtggc 


2220 


aggactttga 


2280 


catactggat 


2340 


ctctgcgtcc 


2400 


acctgttggg 


2460 


ccaatatgca 


2520 


tgaggtttag 


2580 


tgcataattg 


2640 


aagtgcgcgg 


2700 


aagtgtctgt 


2760 


tcctccacgc 


2820 
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cagcgacaac gtggcttctg acaacggctg cgcctttctt atcaagggag ggggtcgcat 2880 

ctgtcacaac atgatatgcg gccctgggga tgtcccccca aagccttacc agatggttac 2940 

ctgcacagat ggcaaggtgc gbatgctcaa gcctgtgcac attgtgggcc accggcgcca 3000 

ccgctggcca gagtttgaac acaatgtgat gacccgctgt agcttgtacc tgggaggcag 3060 

gcgaggagtt ttcttgccca gacagtgtaa cctggcccac tgcaacgtga tcatggaaca 3120 

atccgccgct acccaggttt gctttggagg aatatttgat ataagcatgg tggtgtataa 3180 

gatcctgcgc tacgacgact gtcgggctcg tactcgaacc tgcgactgcg gagcctctca 3240 

cctgtgtaac ctgactgtga tggggatggt gactgaggag gtgcgactgg accactgtca 3300 

gcactctcgc ctgcgggagg agttttcttc ctcggacgag gaggactagg taggtggttg 3360 

gggcgtggcc agcgagaggg tgggctataa aggggaggtg tcggctgacg ctgtcttctg 3420 

tttttcaggt accatgagcg gatcaagcag ccagaccgcg ctgagcttcg acggggccgt • 3480 

gtacagcccc tttctgacgg ggcgcttgcc tgcctgggcc ggagtgcgtc agaatgttac 3540 

cggttcgacc gtggacggac gtcccgtgga tccatctaac gctgcttcta tgcgctacgc 3600 

tactatcagc acatctactc tggacagcgc cgctgccgcc gcagccgcca cctcagccgc 3 660 

tctctccgcc gccaagatca tggctattaa cccaagcctt tacagccctg tatccgtgga 3720 

cacctcagcc ctggagcttt accggcgaga tctagctcaa gtggtggacc aactcgcagc 3780 

cgtgagccaa cagttgcagc tggtgtcgac ccgagtggag caactttccc gccctcccca 3840 

gtaaccgcaa aaattcaata aacagaattt aataaacagc acttgagaaa agtttaaact 3900 

tgtggttgac tttattcctg gatagctggg gggagggaac ggcgggaacg gtaagacctg 3960 

gtccatcgtt cccggtcgtt gagaacacgg tggatttttt ccaagacccg atagaggtgg 4020 

gtctgaacgt tgagatacat gggcatgagc ccgtctcggg ggtggaggta ggcccactgc 4080 

agggcctcgt tttcaggggt ggtgttgtaa atgatccagt cgtaggcccc ccgctgggcg 4140 

tggtgctgga agatgtcctt cagcagcaag ctgatggcaa cgggaagacc cttggtgtag 4200 

gtgttgaeaa agcggttgag ttgggagggg tgcatgcggg gactgatgag gtgcattttg 4260 

gcctggatct tgaggttggc tatgttgccg cccagatcgc gcctgggatt catgttatgc 4320 

aagaccacca gcaccgagta accggtgcag cgggggaatt tgtcgtgcag cttggaaggg 4380 

aaagcgtgga agaatttgga gacccctcgg tgcccgccta ggttttccat gcactcatcc 4440 

atgatgatgg cgatgggccc ccgggaggca gcctgggcaa aaacgttgcg ggggtccgtg 4500 

acatcgtagt tgtggtcctg ggtgagttca tcataggaca ttttgacaaa gcgcgggcag 4560 
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agggtcccag actggggaat gatggttcca tccggtccgg gggcgtagtt gccctcgcag 4 620 

atttgcattt cccaggcttt gatttcagag ggagggatca tgtcaacctg gggggcgatg 4680 

aaaaaaatgg tctctggggc gggggtgatg agctgggtgg aaagcaggtt gcgcaagagc 4740 

tgtgacttgc cgcagccggt gggcccgtag atgacagcta tgacgggttg cagggtgtag 4800 

tttagagagc tacaactgcc atcatccttc aaaagcgggg ccacactgtt taaaagttct 4860 

ctaacatgta agttttcccg cactaagtcc tgcaggagac gtgaccctcc tagggagaga 4920 

agttcaggaa gcgaagcaaa gtttttaagt ggcttgaggc catcggccaa gggcaagttc 4980 

ctgagagttt gactgagcag ttccagccgg tcccagagct cggttacgtg ctctacggca 5040 

tctcgatcca gcagacctcc tcgtttcggg ggttggggcg gctctggctg tagggaatga 5100 

ggcggtgggc gtccagctgg gccatggtgc ggtccctcca tgggcgcagg gttctcttca 5160 

gggtggtctc ggtcacggtg aatgggtggg ccccgggctg ggcgctggcc agggtgcgct 5220 

tgaggctgag gcggctggtg gcgaaccgtt gcttttcgtc tccctgcaag tcagccaaat 5280 

agcaacggac catgagctca tagtccaggc tctctgcggc atgtcctttg gcgcgaagct 5340 

tgcctttgga aacgtgcccg cagtttgagc agagcaagca ttttagcgcg tagagttttg 5400 

gcgccaagaa cacggattcc ggggaataag catccccacc gcagttggag caaacggttt 54 60 

cgcattccac cagccaggtc agctgaggat cttttgggtc aaaaaccaag cgcccgccgt 5520 

tttttttgat gcgcttccta cctcgggtct ccatgaggcg gtgcccgcgt tcggtgacga 5580 

agaggctgtc ggtgtctccg tagacggagg tcagggcgcg ctcctccagg ggggtcccgc 564 0 

ggtcctcggc gtagagaaac tcgcaccact ctgacataaa cgcccgggtc caggctagga 5700 

cgaatgaggc gatgtgggaa gggtaccggt cgttatcgat gagggggtcg gttttttcca 5760 

aggtgtgcag gcacatgtcc ccctcgtccg cttccaaaaa tgtgattggc ttgtaggtgt 5820 

aagtcacgtg atcctgtcct tccgcggggg tataaaaggg ggcgtttccc ccctcctcgt 5880 

cactctcttc cggttcgctg tcgccaaagg ccagctgttg gggtacgtaa acgcgggtga 5940 

aggcgggcat gacctgtgcg itgaggttgt cagtttctat atacgaggaa gatttgatgg 6000 

cgagcgcccc cgtggagatg cccttgaggt gctcggggcc catttggtca gaaaacacaa 6060 

tctgtcggtt atcaagcttg gtggcaaaag acccgtagag ggcgttggag agcaacttgg 6120 

cgatggagcg ctgggtttgg tttttttccc ggtcggcttt ttccttggcc gcgatgttga 6180 

gctggacgta ctccctggcc acgcacttcc agccgggaaa aacggccgtg cgctcgtccg 6240 
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gcaccagcct 


cacgctccat 


ccgcggttgt 


qcacjqataat 

3 wo 333 L 3 QL 


yacgccgacg 


ctggtggcca 


6300 


cctctccgcg 


caggggctcg 


ttggtccagc 


aqaqqcqacc 

~3*"*33 3 v " 


y^LLLuycya 


gagcagaagg 


6360 


ggggcagggg 


gtcaagcagg 


cgctcgtccg 


gqqqqtCQOC 


yLLyaiygca 


aagatggcgg 


6420 


gcagcaggtg 


tttgtcaaag 


taaccgatct 


qatqcccodo 


y ccicicy cagg 


gcggtttccc 


6480 


agtcccgcac 


cgccaaggcg 


cgctcgtatg 


qac tqaqqqq 

3 "3"3333 


yy ^-y 


ggcatgggat 


6540 


gcgtcagggc 


cgaggcgtac 


atgccgcaga 


tgtcatagac 


Ot" AAA flflfifrr' 

y ucidaygggc 


tcctccagga 


6600 


cgccgaggta 


99tggggtag 


cagcgccccc 


w 3 u 33 l *''3^'* 


ggcccgtacg 


tagtegtaga 


6660 


gctcgtgcga 


99gggccaga 


33 4, 33 s '33 u 


L y a yy *-9 age 


gcgctggggc 


ttttcatctc 


6720 


ggaagaggat 


ctgcctgaag 




ant - f ona rtna 

a 3 & t-gg^gga 


gatggtgggc 


cgctgaaaaa 


6780 


tgttgaagcg 


ggcgtcgggc 


agacccacgg 




aaagtgggcg 


taggactctt 


6840 


gcagcttttc 


caccagggag 


qcqqtqacca 




agegcagtag 


tccagggttt 


6900 


cccgcacgat 


gtcataatgc 


tcttcctttt 

W w www W w w w 


tttCCttCCo 


gaggtctegg 


ttgaagagat 


6960 


actcttcgcg 


gtctttccag 


tactcttgga 




gcccccgtct 


ecaeggtaag 


7020 


agcccaacat 


gtaaaactgg 


t tqacQacct 


OA hannn a/-«.a 


gcatcccttc 


tccacgggca 


7080 


gcgagtaggc 


cagggcggcc 


ttqcqcaaaa 


^yyty L yayt 


cagggcaaag 


gtgtcgcgga 


7140 


ccataacttt 


tacaaactgg 


tacttaaagt 


^v^yy t-Cy tc 


gcacatgcct 


cgctcccagt 


7200 


ctgagtagtc 


tgtgcgcttt 


ttatacttaa 


33 1. taggcag 


ggagtaggtg 


acgtcgttaa 


7260 


agaggatttt 


gccacatctg 


ggcataaagt 


**3^*y a 3 a y» 


cccy aagggg 


ccgggcacct 


7320 


ccgagcggtt 


gttgatgact 


tQQQcaacca 


aaaaa a y t* f- r» 
33 a yaotn»t 


gtcgaagccg 


ttgatgttgt 


7380 


gccccacgac 


gtagaactct 


atgaaacgcg 


yay^yLuyLg 


cagcaggggg 


cacttttcaa 


7440 


gttgctggaa 


agtaagttcc 


cQCQqctcaa 


*»y uy llc 


cgtgcggctc 


cagtcctcca 


7500 


ccgggtttcg 


ctccacaaaa 


tcctgccaga 


^ nh Ort H ^"r"»a 

uy cyy u cycle 


tagcaagagc 


tgcagtcggt 


7560 


cgcgaaattc 


gcggaatttt 


ctQCcaataa 


phf or* hh/'ihn 
uWLyCLCCCy 


ggggttcaag 


caaaaaaagg 


7620 


tgtctgcgtg 


gtcgcgccag 


QCQtcccaac 


^»yay cu cgcg 


agecagatte 


agggecagea 


7680 


gcaccagagc 




ytgaccccca 


tgacgaggag 


aaagggcacc 


agctgttttc 


7740 


cgaacgcgcc 


catccaggtg 


taggtctcca 


cgtcgtaggt 


gagaaacaga 


cgttcggtcc 


7800 


gcgggtgcga 


tcccaggggg 


aaaaacttga 


tgggctgeca 


ccattgggag 


ctctgggcgt 


7860 


ggatgtgatg 


gaagtaaaag 


tcccggcggc 


gcgtggaaca 


ttcgtgctgg 


tttttgtaaa 


7920 


agcggccgca 


gtggtcgcag 


cgcgagacgg 


agtgaaggct 


gtgaatcagg 


tgaatcttgc 


7980 
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gccgccgagg 


gggccccaga 


gccaaaaagc 


ggagcgggaa 


cgaccgcgcg 


gccacttcgg 


8040 


cgcccgcagg 


caagatggat 


gagggttcca 


ccgttccccg 


cccgcggacc 


gaccagactt 


8100 


ccgccagctg 


cggcttcagt 


tcttgcacca 


gctctcgcag 


cgtttcgtcg 


ctgggcgaat 


8160 


cgtgaatacg. 


gaagttgtcg 


ggtagaggcg 


ggaggcggtg 


gacttccagg 


aggtgtgtga 


,8220 


gggccggcag 


gagatgcagg 


tggtacttga 


tttcccacgg 


atgacggtcg 


cgggcgtcca 


8280 


aggcgaagag 


atgaccgtgg 


ggccgcggcg 


ccaccagcgt 


tccgcggggg 


gtctttatcg 


8340 


gcggcgggga 


cgggctcccg 


gcggcagcgg 


cggctcggga 


cccgcgggca 


agtcgggcag 


8400 


cggcacgtcg 


gcgtggagct 


cgggcagggg 


ctggtgctgc 


gcgcggagct 


gactggcaaa 


8460 


ggctatcacc 


cggcgattga 


cgtcctggat 


ccggcggcgc 


tgcgtgaaga 


ccaccggacc 


8520 


cgtggtcttg 


aacctgaaag 


agagttcgac 


agaatcaatc 


tcggcatcgt 


taaccgcggc 


8580 


ctggcgcagg 


atttcggcca 


cgtccccgga 


gttgtcttga 


tacgcgattt 


ctgccatgaa 


8640 


ctggtcgatt 


tcctcttcct 


gcaagtctcc 


gtgaccggcg 


cgttcgacgg 


tggccgcgag 


8700 


atcgttggag 


atgcggccca 


tgagctggga 


aaaggcattg 


atgccgacct 


cgttccacac 


8760 


tcggctgtac 


accacctctc 


cgtgaacgtc 


gcgggcgcgc 


atcaccacct 


gggcgagatt 


8820 


gagttccacg 


tggcgggcga 


aaaccggata 


gtttcggagg 


cgctgataca 


gatagttgag 


8880 


ggtggtggcg 


gcgtgctcgg 


ccacaaaaaa 


atacatgatc 


cagcggcgga 


gggtcagctc 


8940 


gttgatgtcg 


cccagcgcct 


ccaggcgttc 


catggcctcg 


taaaagtcca 


cggcaaagtt 


9000 


gaaaaattgg 


ctgttcctgg 


ccgagaccgt 


gagctcttct 


tccaagagcc 


gaatgagatc 


9060 


cgccacggtg 


gccctgactt 


cgcgttcgaa 


agccccgggt 


gcctcctcca 


cctcttcctc 


9120 


ctcgacttct 


tcgaccgctt 


cgggcacctc 


ctcttcctcg 


accaccacct 


caggcggggc 


9180 


tcggcggcgc 


cggcggcgga 


cgggcaggcg 


gtcgacgaaa 


cgctcgatca 


tttcccccct 


9240 


ccgtcgacgc 


atggtctcgg 


tgacggcgcg 


accctgttcg 


cgaggacgca 


gggtgaaggc 


9300 


gccgccgccg 


agcggaggta 


acagggagat 


cggggggcgg 


tcgtggggga 


gactgacggc 


9360 


gctaactatg 


catctgatca 


atgtttgcgt 


agtgacctcg 


ggtcggagcg 


agctcagcgc 


9420 


ttgaaaatcc 


acgggatcgg 


aaaaccgttc 


caggaacgcg 


tctagccaat 


cacagtcgca 


9480 


aggtaagctg 


aggaccgtct 


cgggggcttg 


tctgttctgt 


cttcccgcgg 


tggtgctgct 


9540 


gatgaggtag 


ttgaagtagg 


cgctcttgag 


gcggcggatg 


gtggacagga 


gaaccacgtc 


9600 


tttgcgccca 


gcttgctgta 


tccgcaggcg 


gtcggccatg 


ccccacactt 


ctccttgaca 


9660 
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gcggcggagg tccttgtagt 


attcttgcat 


cagcctttcc 


^yyycacc c 


cgtcttcttc 


9720 


ttccgctcgg ccggacgaga 


gccgcgtcag 


gccqtaccca 




gtggttggag 


9780 


cagggccagg tcggccacga 


cgcgctcggc 


cagcacggcc 


*y '***yy**»»yc 


gggtgagggt 


9840 


gtcctgaaag tcgtcgagat 


ccacaaagcg 


Qtoatacaca 


ccagtgccga 


tggtgtaggt 


9900 


gcagttgctc atgacggacc 


agtttacggt 


ctoaatacca 


^yycccacgg 


tttccaggta 


9960 


gcggagacgc gagtaggccc 


gcgtctcgaa 




LugcagyCCC 


gcagcaggta 


10020 


ctggtagccc accagcagat 


gcqqcqqcqq 




aggggccacc 


gctgggtggc 


10080 


gggggcgttg ggggcgagat 


cttccaacat 




tagcegcaga 


tgtagegega 


10140 


catccaagtg atgccgctgg 


ccqtqqtqct 


yy u y^yyy^y 


tagtcgegaa 


cgcggttcca 


10200 


gatgtttcgc agcggctgga 


aqtactcaat 


yy c y yyy cga 


ctctgccccg 


tgaggeggge 


10260 


gcagtcggcg atgctctacg 


qqqaaaaaqa 


a yyy^ ta ytg 


aacaaccgcc 


ttccgtagcc 


10320 


ggaggagaac gcaagggggt 


caaaqaccac 


^-y »yy wtcyy 


gttcgaaacc 


cgggtggcgg 


10380 


cccgaatacg gagggcggtt 


ttttgctttt 


1 1 c t era a a t- « 


cat cccgtgc 


tgeggcagat 


10440 


gcgtccgaac gcggggtccc 


agtccccqqc 


33 »»y tycy 


geegtgaegg 


cggcttctac 


10500 


ggccacgtcg cgctccaccc 


cgcctaccac 




gcggtggctc 


tgcgcggcgc 


. 1O560 


aggggaaccc gaagcagagg 


cqqtqttqqa 


cat* oaarinAn 
*-y »-yyayyag 


ggccaggggt 


tggctegget 


10620 


gggggccctg agtcccgagc 


ggcacccqcq 


^3 L yy*-i-^ty 


aagcgcgacg 


eggeggagge 


10680 


gtacgtgccg cggagcaatc 


tqtttcqcqa 




gaggaggecg 


aggagatgeg 


10740 


agacttgcgt tttcgggcgg 


ggaqqqaqtt 




ctggaccggc 


agagggttct 


1O800 


gagagaggag gactttgagg 


cggacqaqcq 


vav *yyyyyty 


agtcccgcgc 


gggctcacgt 


10860 


ggcggccgcc aacctggtga 


gcqcqtacqa 




aaggaggaga 


tgaacttcca 


10920 


gaagagcttc aatcatcacg 


tQCacacach 


ya c t,g cgege 


gaagaggtgg 


ccatcggcct 


10980 


catgcatctg tgggattttg 


toaaaacata 


vy u tCayaaC 


cccagcagca 


agccgctgac 


11040 


ggctcagctg ttcctcatcg 


tqcaacataa 


v» 3 <* y d ^. ci d C 


gaaacgccca 


gggaggecat 


11100 


gctgaacatt gcagagcctq 




ycccccggac 


ctcattaaca 


tettgeagag 


11160 


tatcgtagtg caggagcgct 


cgctgagcct 


ggccgacaag 


gtggctgcca 


tcaactacag 


11220 


catgctgtcg ctgggcaaat 


tttacgcccg 


caagatctac 


aagtctccgt 


tcgtccccat 


11280 


agacaaggag gtgaagatag 


acagctttta 


catgegcatg 


gegctcaagg 


tgetgactet 


11340 


aagcgacgac ctgggggtgt 


accgcaacga 


ccgcatacac 


aaggcggtga 


gcgccagccg 


11400 



no 



ccggcgcgag 


ctgagcgacc 


gcgagctttt gcacagcctg 


catcgggcgt 


tgactggtgc 


1X460 


cggcagcgcc 


gaggcggccg 


agtactttga cgccggagcg 


gacttgcgct 


ggcagccatc 


11520 


ccgacgcgcg 


ctggaggcgg 


ctggcgtcgg ggagtacggg 


gtcgaggacg 


acgatgaagc 


11580 


ggacgacgag 


ttgggcattg 


acttgtagcc gtttttcgtt 


agatatgtcg 


gcgaacgagc 


11640 


cgtctgcggc 


cgccatggtg 


acggcggcgg gcgcgcccca 


ggacccggcc 


acgcgcgcgg 


11700 


cgctgcagag 


tcagccttcc 


ggagtgacgc ccgcggacga 


ctggtccgag 


gccatgcgtc 


11760 


gcatcctggc 


gctgacggcg 


cgcaaccccg aggcttttcg 


gcagcagccg 


caggcaaacc 


11820 


ggtttgcggc 


cattttggaa 


gcggtggtgc cctccagacc 


caaccccacc 


cacgaaaagg 


11880 


tgctggccat 


cgtcaacgcc 


ctggcggaga ccaaggccat 


ccgcccagac 


gaggccgggc 


11940 


aggtctacaa 


cgcgctgcta 


gaaagggtgg gacgctacaa 


cagctccaac 


gtgcagacca 


12000 


atctggaccg 


ctfcggtgacg 


gacgtgaagg aggccgtagc 


ccagcgagag 


cggtttttca 


12060 


aggaagccaa 


tctgggctcg 


ctggtggccc tcaacgcctt 


cctgagcacg 


ctgccggcga 


12120 


acgtgccccg 


cggtcaggag 


gaccacgtga actttctgag 


cgccctccgc 


ctgatggtgg 


12180 


ccgaggtgcc 


gcagagcgag 


gtgtaccagt ctggccccaa 


ctactacttc 


cagacctccc 


12240 


ggcagggcct 


gcagacggta 


aacctgacgc aggcictttca 


gaacctgcag 


ggcctttggg 


12300 


gggtgcgcgc 


tccgctgggc 


gaccgcagca cggtgtccag 


cctgctgacc 


cccaatgccc 


12360 


ggctgctctt 


gcttctcatt 


gctccgttca ccgacagcgg 


ttccatcagc 


cgcgactctt 


12420 


acctgggaca 


cctgctcacc 


ctgtaccggg aggccatcgg 


gcaggcgcgg 


gtggacgagc 


12480 


agacgtacca 


ggaaatcacc 


agcgtgagcc gcgcgctggg 


gcaggaggac 


acgggcagct 


12540 


tggaggcgac 


tctgaacttc 


ctgctgacca accggcggca 


gcgcctacct 


ccccagtacg 


12600 


cgctgaacgc 


ggaggaggag 


cgcatcctgc gtttcgfcgca 


gcagagcacc 


gcgctgtact 


12660 


tgatgcggga 


aggcgcctct 


cccagcgctt cgctggacat 


gacggcggcc 


aacatggagc 


12720 


catcgttcta 


cgccgccaac 


cgtcccttcg tcaaccggct 


aatggactat 


ttgcatcggg 


12780 


cggcggccct 


gaacccggaa 


tactttacta acgtcatcct 


gaacgaccgt 


tggctgccac 


12840 


ctcccggctt 


ctacacgggg 


gagttcgacc tcccggaggc 


caacgacggt 


ttcatgtggg 


12900 


acgacgtgga 


cagcgtgttc 


ctgcccggca agaaggaggc 


gggtgactct 


cagagccacc 


12960 


gcgcgagcct 


cgcagacctg 


ggggcgaccg ggcccgcgtc 


tccgctgcct 


cgcctgccga 


13020 


gcgccagcag 


cgccagcgtg 


gggcgggtga gccgtccgcg 


cctcagcggt 


gaggaggact 


13080 
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ggtggaacga tccgctgctc cgtccggccc gcaacaaaaa cttccccaac aacgggatag 13140 

aggatttggt agacaaaatg aaccgttgga agacgtatgc ccaggagcat cgggagtggc 13200 

aggcgaggca acccatgggc cctgttctgc cgccctctcg gcgcccgcgc agggacgaag 13260 

acgccgacga ttcagccgat gacagcagcg tgttggatct gggcgggagc gggaacccct 13320 

ttgcccacct gcaacctcgc ggcgtgggtc ggcggtggcg ctaggaaaaa aaattattaa 13380 

aagcacttac cagagccatg gtaagaagag caacaaaggt gtgtcctgct ttcttcccgg 13440 

tagcaaaatg cgtcgggcgg tggcagttcc ctccgcggca atggcgttag gcccgccccc 13500 

ttcttacgaa agcgtgatgg cagcggccac cctgcaagcg ccgttggaga atccttacgt 13560 

gccgccgcga tacctggagc ctacgggcgg gagaaacagc attcgttact cggagctgac 13620 

gcccctgtac gacaccaccc gcctgtacct ggtggacaac aagtcagcag atatcgccac 13 680 

cttgaactac cagaacgacc acagcaactt tctcacgtcc gtggtgcaga acagcgacta 13740 

cacgcccgcc gaagcgagca cgcagaccat taacttggac gaccgctcgc gctggggcgg 13800 

ggacttgaaa accattctgc acactaacat gcccaacgtg aacgagttca tgtttaccaa 13860 

ctcgttcagg gctaaactta tggtggcgca cgaggccgac aaggacccgg tttatgagtg 13920 

ggtgcagctg acgctgccgg aggggaactt ttcagagatt atgaccatag acctgatgaa 13980. 

caacgccatt atcgaccact acctggcggt agccagacag cagggggtga aagaaagcga 14040 

gatcggcgtc aagtttgaca cgcgcaactt tcgtctgggc tgggacccgg agacggggct 14100 

tgtgatgccg ggggtgtaca cgaacgaagc tttccatccc gacgtggtcc tcttgccggg 14160 

ctgcggggtg gactttacct acagccggtt aaacaacctg ctaggcatac gcaagagaat 14220 

gccctttcag gaagggtttc agatcctgta cgaggacctg gagggcggta acatcccggc 14280 

cctgctggac gtgccggcgt acgaggagag catcgccaac gcaagggagg cggcgatcag 14340 

gggcgataat ttcgcggcgc agccccaggc ggctccaacc ataaaacccg ttttggaaga 144 00 

ctccaaaggg cggagctaca acgtaatagc caacaccaac aacacggctt acaggagctg 14460 

gtatctggct tataactacg gcgacccgga gaagggggtt agggcctgga. ccctgctcac 14520 

cactccggac gtgacgtgcg gttcagagca ggtctactgg tcgctgcctg acatgtacgt 14580 

ggaccctgtg acgtttcgct ccacgcagca agttagcaac tacccagtgg tgggagcgga 14640 

gcttatgccg attcacagca agagctttta caacgagcag gccgtctact cacagctcat 14700 

tcgtcagacc accgccctaa cgcacgtttt caaccgcttc cccgagaacc aaatcctagt 14760 

gcgacctcca gcgcccacca tcaccaccgt cagcgagaac gtgcccgctc taaccgatca 14820 
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tgggtggacg acggtgggga cgacgacgcc ccgggtgccg caagtccaga tctccgccac 7140 

ggaggggcgc aggcgttgca ggaggggacg cagctgcccg ctgtccaggg agtcgagggc 7200 

ggccgcgctg aggtcggcgg gaagcgtttg caagttcact ttcagaagac cggtaagagc 7260 

gcgagccagg tgcacatggt acttgatttc caggggggtg ttggaagagg cgtccacggc 7320 

gtagaggagg ccgtgtccgc gcggggccac caccgtgccc cgaggaggtt tcatctcact 7380 

cgtcgagggc gagcgccggg gggtagaggc ggctctgcgc cggggggcag cggaggcagt 7440 

ggcacgtttt cgtgaggatt cggcagcggt tgatgacgag cccggagact gctggcgtgg 7500 

gcgacgacgc ggcggctgag gtcctggatg tgccgtctct gcgtgaagac caccggcccc 7560 

cgggtcctga acctgaaaga gagttccaca gaatcaatgt ctgcatcgtt aacggcggcc 7620 

tgcctgagga tctcctgtac gtcgcccgag ttgtcttgat aggcgatctc ggccatgaac 7680 

tgctccactt cttcctcgcg gaggtcgccg tggcccgctc gctccacggt ggcggccagg 7740 

tcgttggaga tgcgacgcat gagttgagag aaggcgttga ggccgttctc gttccacacg 7800 

cggctgtaca ccacgtttcc gaaggagtcg cgcgctcgca tgaccacctg ggccacgttg 7860 

agttccacgt ggcgggcgaa gacggcgtag tttctgaggc gctggaagag gtagttgagc 7920 
gtggtggcga tgtgctcgca gacgaagaag tacatgatcc agcgccgcag ggtcatctcg . 7980. 

ttgatgtctc cgatggcttc gagacgctcc atggcctcgt agaagtcgac ggcgaagttg 8040 

aaaaattggg agttgcgggc ggccaccgtg agttcttctt gcaggaggcg gatgagatcg 8100 

gcgaccgtgt cgcgcacctc ctgctcgaaa gcgccccgag gcgcctctgc ttcttcctcc 8160 

ggctcctcct cttccagggg cacgggttcc tccggcagct ctgcgacggg gacggggcgg 8220 

cgacgtcgtc gtctgaccgg caggcggtcc acgaagcgct cgatcattcc gccgcgccgg 8280 

cgacgcatgg tctcggtgac ggcgcgtccg ttttcgcgag gtcgcagttc gaagacgccg 8340 

ccgcgcagag cgcccccgtg cagggagggt aagtggttag ggccgtcggg cagggacacg 8400 

gcgctgacga tgcattttat caattgctgc gtaggcactc cgtgcaggga tctgagaacg 8460 

tcgaggtcga cgggatccga gaacttctct aggaaagcgt ctatccaatc gcagtcgcaa 6520 

ggtaagctga ggacggtggg ccgctggggg gcgtccgcgg gcagttggga ggtgatgctg 8580 

ctgatgatgt aattaaagta ggeggtcttc aggcggcgga tggtggcgag gaggaccacg 864 0 

tctttgggcc cggcctgttg aatgcgcagg cgctcggcca tgccccaggc ctcgctctga 8700 

cagcgacgca ggtctttgta gtagtcttgc atcagtctct ccaccggaac ctctgcttct 8760 

cccctgtctg ccatgcgagt cgagccgaac ccccgcaggg gctgcagcaa cgctaggtcg 8820 
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gccacgaccc tctcggccag cacggcctgt tggatctgcg tgagggtggt ctggaagtcg 8880 

tccaggtcca cgaagcggtg ataggccccc gtgttgatgg tgtaggtgca gttggccatg 8940 

acggaccagt tgacgacttg catgccgggt tgggtgatct ccgtgtactt gaggcgcgag 9000 

taggcgcggg actcgaacac gtagtcgttg catgtgcgta ccagatactg gtagccaacc 9060 

aggaagtggg gaggcggttc tcggtacagg ggccagccga ctgtggcggg ggcgccgggg 9120 

gacaggtcgt ccagcatgag gcgatggtag tggtagatgt agcgggagag ccaggtgatg 9180 

ccggccgagg tggtcgcggc cctggtgaat tcgcggacgc ggttccagat gttgcgcagg 9240 

gggcgaaagc gctccatggt gggcacgctc tgccccgtga ggcgggcgca atcttgtacg 9300 
ctctagatgg aaaaaagaca gggcggtcat cgactccctt ccgtagctcg gggggtaaag 9360 
tcgcaagggt gcggcggcgg ggaaccccgg ttcgagaccg gccggatccg ccgctcccga 9420 
tgcgcctggc cccgcatcca cgacgtccgc gtcgagaccc agccgcgacg ctccgcccca 9480 
atacggaggg gagtcttttg gtgttttttc gtagatgcat ccggtgctgc ggcagatgcg 9540 
acctcagacg cccaccacca ccgccgcggc ggcagtaaac ctgagcggag gcggtgacag 9600 
ggaggaggag gagctggctt tagacctgga agagggagag gggctggccc ggctgggagc 9660 
gccgtcccca gagagacacc ctagggttca gctcgtgagg gacgccaggc aggcttttgt 9720 
gccgaagcag aacctgttta gggaccgcag cggtcaggag gcggaggaga tgcgcgattg 9780 
caggtttcgg gcgggtagag agctgagggc gggcttcgat cgggagcggc tcctgagggc 9840 
ggaggatttc gagcccgacg agcgttctgg ggtgagcccg gcccgcgctc acgtctcggc 9900 
ggccaacctg gtgagcgcgt acgagcagac ggtgaacgag gagcgcaact tccaaaagag 9960 

ctttaacaat cacgtgagga ccctgatcgc gagggaggag gtgaccatcg ggctgatgca 10020 

tctgtgggac ttcgtggagg cctacgtgca gaacccggcc agcaaacctc tgacggccca 10080 

gctgttcctg atcgtgcagc acagccgcga caacgagacg ttccgcgacg ccatgttgaa 10140 

catcgcggag cccgagggtc gctggctctt ggatctgatt aacatcctgc agagcatcgt 10200. 

ggtgcaggag aggggcctca gdttagcgga caaggtggcg gccattaact attcgatgca 10260 

gagcctgggg aagttctacg ctcgcaagat ctacaagagc ccttacgtgc ccatagacaa 10320 

ggaggtgaag atagacagct tttacatgcg catggcgctg aaggtgctga cgctgagcga 10380 

cgacctcggc gtgtaccgta acgacaagat ccacaaggcg gtgagcgcca gccgccggcg 10440 

ggagctgagc gacagggagc tgatgcacag cctgcagagg gcgctggcgg gcgccgggga 10500 
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cgaggagcgc gaggcttact 


tcgacatggg 


agccgatctg 


cagtggcgtc 


ccagcgcgcg 


10560 


cgccttggag gcggcgggct 


accccgacga 


ggaggatcgg 


gacgatttgg 


aggaggcagg 


10620 


cgagtacgag gacgaagcct 


gaccgggcag 


gtgttgtttt 


agatgcagcg 


gccggcggac 


10680 


ggggccaccg cggatcccgc 


acttttggca 


tccatgcaga 


gtcaaccttc 


gggcgtgacc 


10740 


gcctccgatg actgggcggc 


ggccatggac 


cgcattatgg 


cgctgactac 


ccgcaacccc 


10800 


gaggctttta gacagcaacc 


ccaggccaac 


cgtttttcgg 


ccatcttgga 


agcggtggtg 


10860 


ccctcccgca ccaaccccac 


acacgagaaa 


gtcctgacta 


tcgtgaacgc 


cctggtagac 


10920 


agcaaggcca tccgccgcga 


cgaggcgggc 


ttgatttaca 


acgctctgct 


ggaacgggtg 


10980 


gcgcgctaca acagcactaa 


cgttcagacc 


aatctggatc 


gcctcaccac 


cgacgtgaag 


11040 


gaggcgctgg ctcagaagga 


gcggtttctg 


agggacagca 


atctgggctc 


tctggtggca 


11100 


ctcaacgcct tcctgagcac 


gcagccggcc 


aacgtgcccc 


gcgggcagga 


ggactacgtg 


11160 


agcttcatca gcgctctgag 


gctgctggtg 


tccgaggtgc 


cccagagcga 


ggtgtatcag 


11220 


tctgggccgg attacttctt 


ccagacgtcc 


cgacagggct 


tgcaaacggt 


gaacctgact 


11280 


caggccttta aaaacttgca 


aggcatgtgg 


ggcgttaagg 


ccccggtggg 


cgatcgagcc 


11340 


accatctcca gtctgctgac 


ccccaacact 


cgcctgctgc 


tgctcttgat 


cgcgccgttc 


.11400 


accaacagta gcactatcag 


ccgtgactcg 


tacctgggtc 


atctcatcac 


tttgtaccgc 


11460 


gaggccatcg gtcaggctca 


gatcgacgag 


cacacatatc 


aggagatcac 


taacgtgagc 


11520 


cgggccctgg gtcaggaaga 


taccggcagc 


ctggaagcca 


cgttgaactt 


tttgctaacc 


11580 


aaccggaggc aaaaaatacc 


ctcccagttt 


acgttaagcg 


ccgaggagga 


gaggattctg 


11640 


cgatacgtgc agcagtccgt 


gagtctgtac 


ttgatgcggg 


agggcgccac 


cgcttccacg 


11700 


gctttagaca tgacggctcg 


gaacatggaa 


ccgtcctttt 


actccgccca 


ccggccgttc 


11760 


attaaccgtc tgatggacta. 


cttccatcgc 


gcggccgcca 


tgaacgggga 


gtacttcacc 


11820 


aatgccatcc tgaatccgca 


ttggatgccc 


ccgtccggct 


tctacaccgg 


cgagtttgac 


11880 


ctgcccgaag ccgacgacgg 


ctttctttgg 


gacgacgtgt 


ccgacagcat 


tttcacgccg 


11940 


ggcaatcgcc gattccagaa 


gaaggagggc 


ggagacgagc 


tccccctctc 


cagcgtggag 


12000 


gcggcctcta ggggagagag 


tccctttccc 


agtctgtctt 


ccgccagcag 


tggtcgggta 


12060 


acgcgcccgc ggttgccggg 


ggagagcgac 


tacctgaacg 


accccttgct 


gcggccggct 


12120 


aggaagaaaa atttccccaa 


caacggggtg 


gaaagcttgg 


tggataaaat 


gaatcgttgg 


12180 


aagacctacg cccaggagca 


gcgggagtgg 


gaggacagtc 


agccgcgacc 


gctggttccg 


12240 
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ccgcactggc gtcgtcagag agaagacccg gacgactccg cagacgatag tagcgtgttg 1230O 

gacctgggag ggagcggagc caaccccttt gctcacttgc aacccaaggg gcgttccagt 12360 

cgcctctact aataaaaaag acgcggaaac ttaccagagc catggccaca gcgtgtgtcc 12420 

tttcttcctc tctttcttcc tcggcgcggc agaatgagaa gagcggtgag agtcacgccg 12480 

gcggcgtatg agggtccgcc cccttcttac gaaagcgtga tgggatcagc gaacgtgccg 12 540 

gccacgctgg aggcgcctta cgttcctccc agatacctgg gacctacgga gggcagaaac 12600 

agcatccgtt actccgagct ggcacccctg tacgatacca ccaaggtgta cctggtggac 12660 

aacaagtcgg cggacatcgc ctccctgaat tatcaaaacg atcacagcaa ttttctgact 12720 

accgtggtgc agaacaatga cttcaccccg acggaggcgg gcacgcagac cattaacttt 12780 

gacgagcgtt cccgctgggg cggtcagctg aaaaccatcc tgcacaccaa catgcccaac 12840 

atcaacgagt tcatgtccac caacaagttc agggccaggc tgatggttaa aaaggctgaa 12900 

aaccagcctc ccgagtacga atggtttgag ttcaccattc ccgagggcaa ctattccgag 12960 

accatgacta tcgatctgat gaacaatgcg atcgtggaca attacctgca agtggggagg 13020 

cagaacgggg tattggaaag cgatatcggc gtaaaatttg ataccagaaa cttccgactg 13080 

gggtgggatc ccgtgaccaa gctggtgatg ccaggcgtgt acaccaacga ggcttttcac 13140 

cccgacatcg tgctgctgcc ggggtgcggt gtggacttca ctcagagccg tttgagtaac 13200 

ctgttaggga tcagaaagcg ccgccccttc caagagggct ttcagatcat gtatgaggac 13260 

ctggaaggag gtaacattcc aggtttgcta gacgtgccgg cgtatgaaga gagtgttaaa 13320 

caggcggagg cgcagggacg agagattcga ggcgacacct ttgccacgga acctcacgaa 13380 

ctggtaataa aacctctgga acaagacagt aaaaaacgga gttacaacat tatatccggc 1344 0 

actatgaata ccttgtaccg gagctggttt ctggcttaca actacgggga tcccgaaaag 13500 

ggagtgagat catggaccat actcaccacc acggacgtga cctgcggctc gcagcaagtg 13560 

tactggtccc tgccggatat gatgcaagac ccggtcacct tccgcccctc cacccaagtc 13620 

agcaacttcc cggtggtggg caccgagctg ctgcccgtcc atgccaagag cttctacaac 13680 

gaacaggccg tctactcgca actcattcgc cagtccaccg cgcttaccca cgtgttcaat 13740 

cgctttcccg agaaccagat tctggtgcgc cctcccgctc ctaccattac caccgtcagt 13800 

gaaaacgttc ccgccctcac agatcacgga accctgccgc tgcgcagcag tatcagtgga 13860 

gttcagcgcg tgaccatcac cgacgccaga cgtcgaacct gtccctacgt ttacaaagct 13920 
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ctt 9g c gtag Cgg c t cc taa agtg c t c tCt a gtCg cacct t ctaaacat g tccatcctca 139eo 

tc C c t ccc g a taacaacacc ggctgg9gac tgggctccgg caagatgtac ggcggagcca ^ 

aaa ggcg c tc ca 9 tca g cac cc agttcgag t tc g939 cca cttccgtgct occ tgg99ag M 100 

cttacaa 9C9 a 9g actc tCg 99 cc gaacgg c 9gta9acga t acca tagat g cc gtgattg 14160 

cc 9 ac 9 ccc 9 cc 9gt acaac cc C9gaccgg Ccgctagcgc cgcctccacc gtggafctccg ^ 

tg a t c 3 aca 9 c 9tg9tagct ggcgctcggg cctatgctcg ccgcaagagg cggctgcatc ^ 

gaagacgtcg ccccaccgcc g ccat gctgg ca g cca 999 c c gtg c tgagg c ggg ccc gga a4340 

g 99t ag9 ca g aa 9ggctatg c 3 cc 9 c g ct g cc 9 ccaac g c c g cc g cc 999 agggcccgcc 14400 

9aCa " Ct9C CC3CC ^ Ct -ctgcegcc. tc g c t a g cat gg cc agaC cc a 93 a 9 a 93ga 144fi0 

ac 9tgtactg 33t9 c gcgat tc tgtgacgg gagtccgagt 9 cc ggtg c gc a9 cc ga cc t c i 45 20 

ccc 9aagtta 9 aa 9 a t ccaa 3 c t3 c gaaga c gg c 99 tac t gagtctccct gttgttatca 1458Q 

g cccaaca tg agcaagcg c a agtttaaaga agaactgctg c agacg c tgg tg cc tgagat 14640 

c tat9g ccc t cc 9g ac 3t3 a a g cc tg acat ta a g cccc 9 c 3 a tatC aa 3 c 3 t 3tt aaaa a 14700 

3 c 9ggaaaag aaagaggaac tcgcggtggt agacgatggc ggagtggaat ttattaggag 1476Q 

t C tC 9 cccc g c 9acg c a9gg ttcaatggaa agggcggcgg gtacaacgcg ttttgaggcc 14B20 

999cacc 9 c g 9tagttttta ccccgggaga gcggtcggcc gttaggggtt tcaaaaggca l4880 

gt ac 9 a= gag gtgtacggcg acgaggacat attggaacag gcggctcaac agatcggaga 1494Q 

att t9 cc ta c 99aaagcgtt c 9 c 9Ccg c ga a9aC c t99 cc a tc 9 c tttag acag c ggcaa 15 ooo 

ccc cacgcC c ag cctc aaac c tgt3acgct gcagcaggtg ctccccgtga gcgccagcac 1506Q 

gg ac agcaag aggggaataa aaagagaaat ggaagatctg cagcccacca tccagctcat 1512Q 

g9 tccc taaa c 9gcagaggc tggaagaggt cctggagaaa atgaaagtgg acccaagcafc 15leo 

a 9a9 cc 99a c 9tcaaagtca g9 cc 9atcaa agaagtggcc cctggtctcg gggtgcagac 15240 

9 3t33 a tat c c a3atC ccc 3 tc ac3tcag c ttcgaC c g cc 9tggaag cc a tggaaacgca X5300 

a ac 9gaaaC c cct 3 cc 9cga tc ggtaccag ggaagtgg c 3 tt 9 c aaa cc 9 acccctggta 153 6d 

c 9aa t a c 9 cc gcccctcggc 9t c agagg c g .cccgctcgt t.cggccccg cc aa c 9 cc at 15420 

cat 9 cc agaa tatgcgctgc atccgtctat cctgcccacc cccggctacc ggggagtgac 154eo 

9tatc g ccc 9 tca 9gaac cc 9 ccgcc 3 aac cc gt c 9 cc 9 c c 3 cc 3 ctccc 9 tc 9tg ctc t 1554 0 

99ccccc 9tg tc ggCgcg cc g c gtaacacg cc gg99aaag a c agttacca ttcccaaccc X5600 

g c g ctaccac cc tagca tcc ttt aatgact ctgccgtttt 3 c a3atgg ct c tgac tt 9 cc 156e0 
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gcgtgcgcct tcccgttccg 


cactatcgag 


gaaqatctca 


^^ytaggaga 


ggcatggcgg 


15720 


gtagtggtcg ccggcgggct 


ttgcgcaggc 


QcatQaaaaa 


LyyadttLCa 


cccgctctga 


15780 


tacccataat cgccgccgcc 


atcggtgcca 


tacccoorah 


cgccxcagcg 


gecttgeaag 


15840 


cagctcgcaa taaataaacg 


aaqqcttttcr 


cart r- a f- rit- 


ctggtcctga 


ctattttatg 


15900 


cagaaagagc atggaagaca 


tcaattttac 


Cf t*. car* h nrtr* ♦» 


ccgcggcacg 


gctcgcggcc 


15960 


gctcatgggc acctggaacg 


acatcggcac 




aacgggggcg 


ctttcaattg 


16020 


ggggagcctt tggagcggca 


ttaaaaact t 


f~ ("—J i^T ¥~ m f% 

*-yyct.ccacg 


at taaatcct 


aeggcagcaa 


1608O 


agcccggaac agtagtgctg 


gtcaqatact 




ctgaaggaca 


ccaacttcca 


16140 


agaaaaagtg gtcaatgggg 


tqataaccao 


v»an.Cd Cy y C 


geggtagate 


tcgccaacca 


16200 


agcggtgcag aaagagattg 


acaqacattt 


yy aa.agcc.cg 


c g3gtgccgc 


cgcagagagg 


16260 


ggatgaggtg gaggtcgagg 


aaqtaaaaat 


rt y a 99 a ^aag 


ctgcccccgc 


tggagaaagt 


16320 


tcccggtgcg cctccgagac 


cacaoaaaro 


J3 f* ^ O ^ i*^«T ft ^ 

oLccoygcca 


gaactagaag 


aaaccctggt 


16380 


gacggagagc aaggagcctc 




geaagectxg 


aaagagggcg 


cctctccacc 


16440 


ctacccaatg acaaaaccga 




ggctcggccg 


gtgtacggga 


aggactacaa 


16500 


gcctgtcacg ctagagctcc 


CCCcdccorr 


accgccgccc 


cccacgcgcc 


cgaccgttcc 


16560 


cccccccctg ccggctccgt 


cqqcqQqace 




cccgtcgccg 


tgcctctgcc 


16620 


agccgcccgc ccagtggccg 


tqqccactac 


faoaaa r*> 
v-ayaaatCCC 


a 9 a ggccaga 


gaggagecaa 


16680 


ctggcaaagc acgctgaaca 


gcatcqtqqq 


^-v-^-yyyay i»y 


aaaagcctga 


aacgccgccg 


16740 


ctgctattat taaaagtgta 


gc taaaaaa t 




atacgcctcc 


tatgttaccg 


16800 


ccagagacgc gtgactgtcg 


CCQCOaorar 


LgcLCLcaag 


atggccaccc 


catcgatgat 


16860 


gccgcagtgg tcttacatgc 


wo wv<Mt v. y y 


gcaggacgcc 


teggagtace 


tgagccccgg 


16920 


tctcgtgcag ttcgcccgcg 




ctacttcagc 


ttgggaaaca 


agtttagaaa 


16980 


ccccaccgtg gcccccaccc 


acaatataar 


cacggaccgc 


tegcaaagge 


tgaccctgcg 


17040 


ttttgtgccc gtagaccqqq 




gracccttac 


aaagtgcgct 


acacgctggc 


17100 


cgtaggggac aaccgagtgc 


tggacatggc 


cagcacctac 


tttgacatcc 


ggggagtgct 


17160 


ggatcgcggt cccagtttta 


agccctactc 


gggtaccgcg 


tacaattccc 


tggctcccaa 


17220 


gggcgctccc aaccctgcag 


aatggacgaa 


ttcagacagc 


aaagttaaag 


tgagggcaca 


17280 


ggcgcctttt gttagctcgt 


at 9gtgctac 


agegattaca 


aaagagggta 


ttcaggtggg 


17340 
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agtaacctta acagactccg gatcaacacc acagtatgca gataaaacgt atcagcctga 17400 

gccgcaaatt ggagaactac agtggaacag cgatgttgga accgatgaca aaatagcagg 17460 

aagagtgcta aagaaaacaa cgcccatgtt cccttgttac ggctcatatg ccaggcccac 17520 

taatgaaaaa ggaggacagg caacaccgtc cgctagtcaa gacgtgcaaa atcccgaatt 17580 

acaatttttt gcctctacta atgtcgccaa tacaccaaaa gcagttctat atgcggagga 17640 

cgtgtcaatt gaagcgccag acactcactt gg tgttC aaa ccaacagtca ctgaaggcat 17700 

tacaagttca gaggctctac tgacccaaca agctgctccc aaccgtccaa actacatagc 17760 

ctttagagat aattttattg gtctcatgta ctacaatagc acaggtaaca. tgggagtact 17820 

ggcaggccag gcttctcagc taaatgcagt tgttgacctg caagacagaa atactgagct 17880 

gtcctaccaa ctcatgttgg acgccctcgg agaccgcagt cggtactttt ctatgtggaa 17940 

ccaagctgtg gatagttacg atcctgatgt aagaatcata gaaaaccatg gcgtagaaga 18000 

tgaattgcct aattattgct ttcctttggg aggcatggca gtaaccgaca cctactcgcc 18060 

tataaaggtt aatggaggag gcaatggatg ggaagccaat aacggcgttt tcaccgaaag 18120 

aggagtggaa ataggttcag ggaacatgtt tgccatggag attaacctgc aagccaacct 18180 

atggcgtagc tttctgtact ccaatattgg gctgtacctg ccagactctc tcaaaatcac 18240 

tcctgacaac atcacactcc cagagaacaa aaacacctat cagtatatga acggtcgcgt 18300 

gacgccaccc gggctggttg acacctacgt taacgtgggc gcgcgctggt cccccgatgt 18360 

catggacagt attaaccctt ttaatcacca ccgcaacgcc ggactccgct accgttccat 18420 

gctcctggga aacggacgct acgtgccctt ccacatccag gtgccccaga aattctttgc 18480 

aattaaaaac ctgctgctgc tccccggttc ctacacctac gagtggaact tccgcaagga 18540 

cgtgaacatg atcttgcaga gctcgctggg caatgacctg cgagtggacg gggccagcat 18600 

ccgcttcgac agcatcaacc tgtacgccaa ctttttcccc atggcccaca acacggcctc 18660 

caccctggaa gccatgctgc gcaacgacac caacgaccaa tctttcaacg actacctgtg 18720 

cgcggccaac atgctgtacc ccatccccgc caacgccacc agcgtgccca tctccattcc 18780 

ctctcgcaac tgggcagcct tcaggggctg gagtttcacc cgcctcaaaa ccaaggagac 18840 

cccctcgctg ggctccgggt tcgaccccta cttcgtctac tccggctcca tcccctacct 16900 

ggacggcacc ttctacctca accatacttt caaaaaggtg tcaatcatgt tcgactcctc 18960 

cgtcagctgg cccggcaacg accgtctgct gacgcccaac gagttcgaaa tcaagcgttc 19020 

9gtggacggt gaagggtaca acgtggctca gagcaacatg accaaggact ggttcctgat 19080 
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tcagatgctc agccactaca acatcggcta ccagggcttc tacgtgcccg aaaattacaa 19140 

ggaccgcatg tactctttct tcagaaactt ccaacccatg agccgccaaa ttgtagattc 19200 

aacggcttac actaattatc aggatgtgaa actgccatac cagcataaca actcagggtt 19260 

cgtgggctac atgggaccca ccatgcgaga ggggcaggcc tacccggcca actatcccta 19320 

tcccctgatt ggggccaccg ccgtgcccag cctca'cgcag aaaaagttcc tctgcgaccg 19380 

ggtgatgtgg aggatcccct tctctagcaa cttcatgtct atgggctccc tcaccgacct 19440 

ggggcagaac atgctgtacg ccaactccgc tcacgccttg gatatgacct ttgaggtgga 19500 

tcccatggat gagcccacgc ttctctatgt tctgtttgaa gtcttcgacg tggtgcgcat 19560 

ccaccagccg caccgcggcg tcatcgaggc cgtctacctg cgcacacctt tctctgccgg 19620 

taacgccacc acctaaagaa gccgatgggc tccagcgaac aggagctgca ggccattgtt 19680 

cgcgacctgg gctgcgggcc ctactttttg ggcaccttcg acaagcgttt tcccggcttc 19740 

atgtcccccc acaagccggc ctgtgccatc gttaacacgg ccggacggga gaccgggggg 19800 

gtccactggc tcgccttcgc ctggaacccg cgtaaccgca cctgctacct gttcgaccct 19860 

tttggtttct ccgacgaaag gctgaagcag atctaccagt tcgagtacga ggggctcctc 19920 

aagcgcagcg ctctggcctc cacgcccgac cactgcgtca ccctggaaaa gtccacccaa 19980. 

acggtccagg ggcccctctc ggccgcctgc gggctcttct gttgcatgtt tttgcacgcc 20040 

ttcgtgcact ggcctcacac ccccatggat cacaacccca ccatggatct gctcaccgga 20100 

gtgcccaaca gcatgcttca cagcccccag gtcgccccca ccctgcgccg taaccaggaa 20160 

cacctgtatc gctttctggg gaaacactct gcctattttc gccgccaccg gcagcgcatc 20220 

gaacgggcca cggccttcga aagcatgagc caaagagtgt aatcaataaa aaacattttt 20280 

atttgacatg atacgcgctt ctggcgtttt attaaaaatc gaagggttcg agggaggggt 20340 

cctcgtgccc gctggggagg gacacgttgc gatactggaa acgggcgctc caacgaaact 20400 

cggggatcac cagccgcggc aggggcacgt cttctaggtt ctgcttccaa aactgccgca 20460 

ccagctgcag ggctcccatg acgtcgggcg ccgatatctt gaagtcgcag ttagggccgg 20520 

agctcccgcg gctgttgcgg aacacggggt tggcacactg gaacaccagc acgccggggt 2O580 

tgtggatact ggccagggcc gtcgggtcgg tcacctccga cgcatccaga tcctcggcgt 20640 

tgctcagggc aaacggggtc agcttgcaca tctgccgccc aatctggggt actaggtcgc 20700 

gcttgttgag gcagtcgcag cgcagaggga tcaggatgcg tcgctgcccg cgttgcatga 20760 
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cagggcaact cgccgccagg 


aactcctcca 


tttgacggaa 


ggccatctgg 


gctttgccgc 


20820 


ccccggcgca gaatagcccg 


caggacttgc tagagaatac gttatgaccg 


cagttgacgt 


20880 


tcLt-cgcgca gcagcgggcg 


tcttcgttct 


tcagctgaac cacgttgcgg 


ccccaacggt 


20940 


cccggaccac cttggctcta 


gtggggtgct 


ccttcagcgc ccgctgtccg 


ttctcgctgg 


21000 


ttacatccat ttccaacacg 


tgctccttgc 


agaccatctc 


cactccgtgg 


aagcaaaaca 


21060 


ggacgccctc ctgctgggta 


ctgcgatgct 


cccatacggc gcatccggtg 


ggctcccagc 


21120 


tcttgtgttt tacccccgcg 


taggcttcca 


tgtaagccat 


aaggaatctg 


cccatcagct 


21180 


cggtgaaggt cttctggttg 


gtgaaggtta 


gcggcaggcc 


gcggtgctcc 


tcgttcaacc 


21240 


aagtttgaca gatcttgcgg 


tacaccgctc 


cctggtcggg 


cagaaactta 


aaagccgctc 


21300 


tgctgtcgtt gtctacgtgg 


aacttctcca 


ttaacatcat 


catggtttcc 


atacccttct 


21360 


cccacgctgt caccagtggt 


ttgctgtcgg 


ggttcttcac 


caacacggcg 


gtagaggggc 


21420 


cctcgccggc cccgacgtcc 


ttcatggtca 


ttctttgaaa 


ctccacggag 


ccgtccgcgc 


21480 


gacgtactct gcgcaccgga 


gggtagctga 


agcccacctc 


caccacggtg 


ccttcgccct 


21540 


cgctgtcgga gacaatctcc 


ggggatggcg 


gcggcgcggg 


tgtcgccttg 


cgagccttct 


21600 


tcttgggagg gagctgaggc 


gcctcctgct 


cgcgctcggg 


gctcatctcc 


cgcaagtagg 


21660 


gggtaatgga gctgcctgct 


tggttctgac 


ggttggccat 


tgtatcctag 


gcagaaagac 


21720 


atggagctta tgcgcgagga 


aactttaacc 


gccccgtccc 


ccgtcagcga 


cgaagatgtc 


21780 


atcgtcgaac aggacccggg 


ctacgttacg 


ccgcccgagg 


atctggaggg 


gcctgaccgg 


21840 


cgcgacgcta gtgagcggca 


ggaaaatgag 


aaagaggagg 


cctgctacct 


cctggaaggc 


21900 


gacgttttgc taaagcattt 


cgccaggcag 


agcaccatag 


ttaaggaggc 


cttgcaagac 


21960 


cgctccgagg tgcccttgga 


cgtcgccgcg 


ctctcccagg 


cctacgaggc 


gaaccttttc 


22020 


tcgcctcgag tgcctccgaa 


gagacagccc 


aacggcacct 


gcgagcccaa 


cccgcgactc 


22080 


aacttctacc ccgtgttcgc 


cgtaccagag 


gcgctggcca 


cctatcacat 


ttttttcaaa 


22140 


aaccaacgca tccccctatc 


gtgccgggcc 


aaccgcaccg 


cggccgatag 


gaatctcagg 


22200 


cttaaaaacg gagccaacat 


acctgatatc 


acgtcgctgg 


aggaagtgcc 


caagattttc 


22260 


Qagggtctgg gtcgagatga 


gaagcgggcg gcgaacgctc 


tgcagaaaga 


acagaaagag 


22320 


agtcagaacg tgctggtgga 


gctggagggg gacaacgcgc 


gtctggccgt 


cctcaaacgc 


22380 


tgcatagaag tctcccactt 


cgcctacccc 


gccctcaact 


tgccacccaa 


agttatgaaa 


22440 


tcggtcatgg atcagctgct 


catcaagaga gctgagcccc 


tggatcccga 


ccaccccgag 


22500 
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gcggaaaact 


cagaggacqcr aaaoccenf r ahranr-rta^rt 


a 9gagctcga 


gcggtggctg 


22560 


gaaaccaggg 


aCCCCCaaca Qttaraaoan annrn^a 

3 ^^ymoyay oHgcgcaaga 


tgatgatggc 


ggccgtgctg 


22620 


gtcaccgtgg 


aqcfcqqaatQ cctacaa^oo t- f- *- »- 


acgtggagac 


getaegcaaa 


22660 


atcggggaat 


v j vu ^k« i * ^aLi,uLv>i.yc cagggccacg 


tccgccaggc 


ctgeaagate 


22740 


tccaacgtgg 


» 3 ^uv»yv.oa ^-v-tggccccc cacacgggca 


tcctccacga 


gaaccggctg 


22800 


gggcaqagcq 

4m* mJ J J J *7 


, -3 wv -y^oLLg cacccugcaa ggcgaggcgc 


ggegggacta 


cgtgcgagac 


22860 


tgcatctacc 


tuLLLcicac ccccacccgg cagaccgcca 


tgggcgtctg 


gcagcagtgc 


22920 


ttqqaaqaqa 


yaadtctcaa agagccagac aaactcctct 


gccgccagcg 


gcgcgccctg 


22980 


tggtccggtt 


^-v-aytyetycg cacggccgcc agcgctctgg 


eggacatcat 


cttcccggag 


23040 


cgcctgatga 


aaaccttgca aaacggcctg ccggatttca 


tcagtcaaag 


cattttgeaa 


23100 


aacttccgct 


vwwvwyt^i, yy»»tyctcc gggaccccgc 


ccgccatgag 


ctgcgcgcta 


23160 


ccttctgact 


^3*-«-v-v,v».^ t. Cwccuaccyc gagcgcccfcc 


ccccactgtg 


gagccactgc 


23220 


tacctcttcc 


»w^vyyuLdd etccctggcc caccactccg 


acctcatgga 


agaegtaage 


23280 


Qqaqaqqqtt 


tactggagtg ccactgccgc tgcaacctgt 


gcacccccca 


cagategctg 


23340 


gcctgcaaca. 


^'-yay^tdct cagcgaaacc ■ caggtcatag 


gtaccttcga 


gatccagggg 


23400 


ccccaacaac 


adgagggege ttccggcttg aagctcactc 


cggcgctgtg 


gacctcggct 


23460 


tacttacgca 


aaLLtytagc cgaggactac cacgcccaca 


aaattcagtt 


ttacgaagac 


23520 


caatctcgac 


to^Lydaagc cccccccacg gcctgcgtca 


tcacccagag 


caagatcctg 


23580 


gcccaattgc 


aatccatcaa ccaagcgcgc cgcgatttcc 


ttttgaaaaa 


gggtcggggg 


23640 


gtgtacctqq 


av,\.uv.i.etydg eggegaggaa ctcaacccgt 


ccacactctc 


egtcgaagea 


23700 


gcccccccga 


gacatgccgc ccaagggaac cgccaagcag 


ctgatcgctc 


ggcagagagc 


23760 


gaagaagcaa 


yayv»tyLtgg agcagcaggc ggaggacgag 


gaagagatgt 


gggacageca 


23820 




gtgtcagagg acgaggagga gatggaaagc 


tgggacagee 


tagacgagga 


23880 


ygaggacgag 


ctttcagagg aagaggegae cgaagaaaaa 


ccacctgcat 


ccagcgcgcc 


23940 


ttctctgagc 


cgacagccga agccccggcc cccgacgccc 


ccggccggct 


cactcaaagc 


24000 


cagccgtagg 


tgggacgeca ccgaatctcc ageggcageg 


geaaeggcag 


egggtaagge 


24060 


caaacgcgag 


eggegggggt attgetcctg gcgggcccac 


aaaagcagta 


ttgtgaactg 


24120 


cttgcaacac 


tgcgggggaa acatctcctt tgcccgacgc 


tacctcctct 


tccatcacgg 


24180 
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tgtggccttc cctcgcaacg ttctctatta ttaccgtcat ctctacagcc cctacgaaac 24240 

gctcggagaa aaaagctaag gcctcctccg ccgcgaggaa aaactccgcc gccgctgccg 24300 

ccgccaagga tccaccggcc accgaagagc tgagaaagcg catctttccc actctgtatg 24360 

ctatctttca gcaaagccgc gggcagcacc ctcagcgcga actgaaaata aaaaaccgct 24420 

ccttccgctc gctcacccgc agctgtctgt accacaagag agaagaccag ctgcagcgca 24480 

ccctggacga cgccgaagca ctgttcagca aatactgctc agcgtctctt aaagactaaa 24540 

agacccgcgc tttttccccc tcggccgcca aaacccacgt catcgccagc atgagcaagg 24600 

agattcccac cccctacatg tggagctatc agccccagat gggcctggcc gcgggggccg 24660 

cccaggacta ctccagcaag atgaactggc tcagcgccgg cccccacatg atcccacgag 24720 

ttaacggcat ccgagcccac cgaaaccaga ttctcttaga acaggcggca ateaccgcca 24760 

caccccggcg ccaactcaac ccgcctagtt ggcccgccgc ccaggtgtat caggaaaatc 24840 

cccgcccgac cacagtcctc ctgccacgcg acgcggaggc cgaagtcctc atgactaact 24900 

ctggggtaca attagcgggc gggtccaggt acgccaggta cagaggtcgg gccgctcctt 24960 

accctcccgg gagtataaag agggtgatca ttcgaggccg aggtatccag ctcaacgacg 25020 

agacggtgag ctcctcaacc ggtctcagac ctgacggagt cttccagctc ggaggagcgg 25080 

gccgctcttc cttcaccact cgccaggcct acctgaccct gcagagctcc tcctcgcagc 25140 

cgcgctccgg gggaatcggc actctccagt tcgtggaaga gttcgttccc tccgtctact 25200 

tcaacccctt ctccggctcg cctggacgct acccggacgc cttcattccc aactttgacg 25260 

cagtgagtga atccgtggac ggctacgact gatgacagat ggtgcggccg tgagagctog 25320 

gctgcgacat ctgcatcact gccgtcagcc tcgctgctac gctcgggagg cgatcgtctt 25380 

cagctacttt gagctgccgg acgagcaccc tcagggtccg gctcacgggt tgaaactcga 2544 0 

gatcgagaac gcgctcgagt ctcgcctcat cgacaccttc accgcccgac ctctcctggt 25500 

agaaatccaa cgggggatca ctaccatcac cctgttctgc atctgcccca cgcccggatt 25560 

acatgaagat ctgtgttgtc atctttgcgc tcagtttaat aaaaactgaa ctttttgccg. 25620 

caccttcaac gccatctgtg atttctacaa caaaaagttc ttctggcaaa ggtacacaaa 25680 

ctgtatttta ttctaatcct acctcatcta tcgtgctgaa ctgcgcctgc actaacgaac 25740 

ttatccagtg gattgcaaac ggtagtgtgt gcaagtactt ttgggggaac gatatagtta 25800 

gtagaaataa cagcctttgc gagcactgca actcctccac actaatcctt tatcccccat 25860 

ttgttactgg atggtatatg tgcgttggct ccggtttaaa tcctagttgc tttcataagt 25920 
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ggtttctaca aaaagagacc cttcccaaca attc tgtttc ttttttcgcc ctatcctact 25980 

gctgttctcc ctctggttac tc tttC aaac ctctaattgg tattttagct ttgatactca 26040 

taatctttat taactttata ataattaaca acttacagta aacatgcttg ttctactgct 26X00 

cgccacatct ttcgctctct ctcacgccag aacaagtatt gttggcgcag gttacaatgc 26X60 

aactcttcaa tctgcttaca tgccagattc cgaccagata ccccatatta cgtggtactt 26220 

acaaacctcc aaacctaatt cttcatttta tgaaggaaac aaactctgcg atgactccga 26280 

caacagaacg cacacatttc cccacccttc actacaattc gaatgcgtaa acaaaagctt 26340 

gaagctttac aacttaaagc cttcagattc tggcttgtac catgctgtag ttgaaaaaag 26400 

taatttagaa gtccacagtg attacattga attgacggtt gtggacctgc cacctccaaa 26460 

atgtgaggtt tcctcctctt accttgaagt tcaaggcgtg gatgcctact gcctcataca 26520 

cattaactgc agcaactcta aatatccagc tagaatttac tataatggac aggaaagtaa 26580 

t c ttttttat tatttaacaa caagcgctgg taacggtaaa cagttacctg actattttac 26640 

tgc t g ttgtt gaa ttttC ca cctacagaga aacgtatgcc aagcggcctt acaatttctc 26700 

atacccgttt aacgacc ttt gcaatgaaat. acaagcgctc gaaactggaa ctgattttac 26760 

tccaattttcattgctgcca ttgttgtaag cttaattacc attattgtca gcctagcatt 26820 

ttactgcttt tacaagccca aaaaccctaa gt tt gaaaaa cttaaactaa aacctgtcat 26880 

tcaacaagtg tga tttt g tt ttccagcatg gtagctgcat ttctacttct cctctgtcta 26940 

cccatcattt tcgtctcttc aactttcgcc gcagtttccc acctggaacc agagtgccta 27000 

ccgccttttg acgtgtatct gattctcacc t ttg ttt gtt gtatatccat ttgcagtata 27060 

gcctgotttt ttataacaat ctttcaagcc gccgactatt ttC acg t gcg aattgcttac 27X20 

tttagacacc atcctgaata cagaaatcaa aacgttgcct ccttactttg tttggcatga 27X80 

ttaagttatt gctgatactt aactatttac ccctaatcaa ctgtaattgt ccattcacca 27240 

aaccctggtc attctacacc tgttatgata aaatccccga cactcctgtt gcttggcttt 27300 

acgcagccac cgccgctttg gtatttatat ctacttgcct tggagtaaaa ttgtatttta 27360 

ttttacacac tgggtggcta catcccagag aagatttacc tagatatcct cttgtaaacg 27420 

cttttcaatt acagcctctg cctcctcctg atcttcttcc tcgagctccc tctattgtga 27480 

gctactttca actcaccggt ggagatgact gactctcagg acattaatat tagtgtggaa 27540 

agaatagctg ctcagcgtca gcgagaaacg cgagtgttgg aatacctgga actacagcaa 27600 
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cttaaagagt cccac t gg tg tgagaaagga gtgctgtgcc atgttaagca ^ 

t cc t ac gatg tC a gcgttca gggacatgaa ctgccttaca ^ 

acc ttct gca ccat g at ggg ctctacctcc atcacaatca cccaacaagc ^ 

ga gggg9 cta tce-tetfltc. c tg tcac 9 ca cctgattgca tg tccaaac t aatcaaaact 27840 

ctct gtgctt taggtgatat ttttaaggtg taaatcaata ataaacttac ctfcaaatttg 279qo 

acaacaaa tt tc tggtg aca tca ttC a g ca g caccact tt accctcttcc ca gct c t c gt 27960 

at gggatgcg a tagtgggtg gcaaacttcc tccaaaccct aaaagaaata t , ggtatcca 28020 

etteettgte ctcacccaca .ttttctct tttcatagat gaaaagaacc agagttgatg 28080 

aa g act tcaa cccc gtct ac ccctatgaca ccacaaccac tcc t gca gtt ccctttatat 28140 

cacccccctt tgt aaaca g c g a tggtcttc aggaaaaccc cccaggtgtfc ttaagfcctgc 282oq 

gaatagctaa acccctatat ttc 9 aca tgg agagaaaact agccctttca c ttggaag a g 28260 

9 gtt ga caa t taccgccgcc g9 acaa ttag aaagtacgca gagcgtacaa accaacccac ^ 

cgttgataat taccaacaac aacacactga ccc t ac g tca ttctcccccc ttaaacctaa 28380 

ct 9 acaata g Cttagtgcta ggctactcga gtcctctccg cgtcacagac aacaaactta 28440 

catt t aac tt cacatcacca ctccgtt.tg aaaatgaaaa ccttActttt aacCatacag 28SQQ 

a g cc tcttaa ac Ctat aaa t aaca gcCttg cca ttg acat caattcctca aaagg CCtt a 28560 

gtagcgtcgg a ggct cac t a gctgtaaacc tgagttcaga cttaaagttt gacagcaacg 28620 

3 atcca t a 9 c ttttggct. caaaccc tgt ggaccgctcc gacctcgact ggcaactgca 28680 

cc 9tctac a g c 3agggcgat tccctactta gtctctgttt aaccaaatgc ggagctcacg 28?40 

tc tt a 9g aa g t g taa gttta acc ggttt aa caggaaccat aacccaaatg actgatattt 28800 

ctgtcaccat tcaa tttaca tttgacaaca atggtaagct actaagctct ccacttataa 28860 

acaacgcctt tagtattcga cagaatgaca gtacggcctc aaaccctacc tacaac.ccc 28920 

t 39 c gtCt a t gcctaacagt accata Catg caagaggggg agg t gg C g aa ccacgaaaca 28980 

actactacgt ccaaacgtat ct t agg ggaa atgttcaaaa accaatcatt cttactgtaa 29040 

cctacaactc agtcgccaca ggatattcct tatcttttaa gtggactgct cttgcacgtg 29X00 

aaaagtttgc aaccccaaca acctcgtttt gctacattac agaacaataa aaccgtgtac 29X60 

cccaccgttt cgtttttttc a gat9 aaac g gg cgagagtt gatgaagact tcaacccagt 29220 

gtacccttat gaccccccac atgctcctgt Catgcccttc attactccac cttttacctc 29280 

ctcggatggg ttgcaggaaa aacca Cttgg a gtg ttaa gt ttaaactaca gagatcccat 29340 



82 



tactac.caa aa tgagtct c tt acaa tt aa ac tagg aaac gg cctcac t c ta.acaacca 2940 0 

gg3 acaacta aca t caacc 9 ct 99 c 9 aa gt a.aacctcca ctcactaac. ctaacaacaa 294 60 

ac ttg cac tg gtct a t a gcg a t cct tt a gc a gt aaa gcg c aaca 9 cc t aa ccttatc.ca 2952 0 

cacc gCt ccc ct tgttattg ccgataactc tttagcattg caagtttcag agcctatctt 295flo 

tataaatgac aa gg acaaac ta 9 ccc tg ca aaca 9 cc 9 c g ccccttgtaa ctaac 9 ct gg 29640 

caccctccgc ttacaaagcg ccgccccttt a ggc at tg ca gaccaaaccc taaaactcct 297 00 

g tttaccaac cc tttgtact tgC a g aataa c ttt c t cac g tt a 9 ccatt 9 aac 9 acccc t 29760 

t 9 cca tt acc aa tactgg aa a 9 ct gg ctct aca.ctctcc ccacc g ctac aaaca 9 ca 9 a 29820 

caca 9gcttg ac ttt9 caaa ccaac gtg cc attaact gta a g caac ggga ccc t a 9gCtt 298e0 

agcca t aaa g cgcccactta tta ttC a gg a caacaact tg ttt ttgg ac t t ca g a gctC c 29940 

cc t9Cgt c tt tt caaca 9 c g acccagtact a gggctt aac Ctttacaccc ctct tg c 9gt 30000 

ac 9 c g a tg a g gcg c t cac tg tt aacaca gg cc gcgg cc t c aca gtgagtt ac 9 a tggttt 30 060 

aattttaaat c ttg9taa9g atcttcgctt t9acaacaac accgtttctg tcgctcfcCag 3^ 

t gc t9 ctt tg .cc ttt acaat acac tg a t ca 9 cttc 9 cc tt aac gtgggcg ctgggct9C3 30180 

ttacaatcca gtg a gtaag * aat tgg acgt gaaccccaat caaaacaa 9g gttta acc tg 30 240 

gsaaaatgac tacctcatt. taaa gct a gg aaat ggatt a ggttttgatg gcgatggaaa 30 300 

CaU9Ct9tt tCtCCtCaa9 ttaCat ^ CC tgg accac tg cc 9 accca t c 30360 

ccccaa ttgt tccatctaca ^ 

a 39gggtgtg gttcacggtt ctgttgcttt aaaagcattg aaaggaaccc tattgagtcc 30480 

tac ggaa a gc gc ca Ctg tta ttatactaca ttttgacaat tatgga9tgc gaattctcaa 30540 

ttatcccact ttgg3 cactc aa gg cac gCt ggg aaataat 9 caac ttggg g ttata ggca 30600 

ggg a g aatct g ca g acacta a tgt ac t caa tg cac tag ca tttatgccca gttcaaaaag 3 0660 

9t acccaa 9a 99gcgtggaa gcgaagttca 9 aa t caaac t gtgg9 ctaca c ttgtataca 30720 

gggtg ac ttt tct at g ccc g t acc gt acca aataca.tac aactatggac eaactggcta 3078( , 

ctcc tttaaa tttatttgga gaactgtttc catgcfcgttt 3o84o 

tttctctt.c attac gg aa 3 aataaa acaa ctttttcttt ttattttctt tttatttt.c 30.00 

ac 9Ca ca gt a aggcttcctc cacccttcca tctcacagca tacacca 9 cc tctccccctt 30 9 60 

ca t99 ca gt a aac tgCtgtg a 9 tca gt cc g gtatttggga gttaagatcc aaacagtctc 31020 
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ttt**.* aaaca tg9at cc gt9atgga cacaaatccc tgggaca9gC tctccaacgt ^ 
ttc 9gt aaaa aact 9 ca tgc c.ccctacaa aaca a aca gg tt ca 9gctct ccacgggtta JW| 
tctcccc 9 a t caaac t ca ga ca gagt a aag gtgcgatgat gttccactaa ^ 
t S 3 = 9 c tgt c t9 aacc t c t c 9gtgcgactc ctgtgaggct ^ 
a 9 ca 9 cc t ca ca 9 ca tgg a t ca t ca gtct a c 9agtgcgtc tggcgcagca gcgcatctga 
a t cccac tg a 9 at t cc 99 ca a 9 aa tcg cac accatcacaa tC a ggttgtt catgatccca 31380 . 
ta 3 ct 9aa ca c 9 c t cca 9 cc aaa 9 ctca tt c 9 ctccaaca gcg ccacc g c gtgtccgtcc 31440 
aaccttactt t aacataaat ca ggtgtctg ccg c gt acaa aca tg ctacc c.cataca.a 3X500 

acc t ccc ggg gcaaacccct gttcaccacc ^^^^ ^^^^ 3i5so 

a 9gg a 9 cca t a 9 a tagC ca t tttaaaccaa ttagctaaca ^ 

t 9 aa 9agaac cgggagagtt acaatgacag cgaataatcc atctctcata ^^^^^ ^ 

9tc tgatgga aatccagatc taacgtggca cagcagatac acaccttcafc atacafctttc ^ 

atcaca tgtt tttcccaggc cgtCaaflata ^ 

acaataaa 9c taatacaaga tggtatactc ctcacctcac ^ 

t tt tcaca tt ct a, gt acc 3 a g a gttct cc tCt acaaca g cact 9 cc 9 c 9 . gtcctcacaa 3 1920 

99t9gt a gct ggtgacgatt gtaaggagcc gataccgtct gtcgcgttgc . 3i9eo 

atc 3tagaC c agggaccgac gcacttcctc gtacttgtag ^^^^ acgtccgctg 32Q4o 

cca gcacgtc tccaagtaac gccggtcccfc gcgtcgctca cgcfcccctcc 

9t 9 caaccac tCttgCaatc cacacagatc cctctcggcc tccggggcga tgcacacctc 32i6Q 

aaacc taC a g a tgtct c ggt acagCtccaa acacgfcagtg agggcgagtt ccaaccaaga ^ 

ca g aca gcct g atc tat ccc 9 acaca Ctgg aggtggagga agacacggaa gaggcatgtt ^ 

at t cca agcg att caccaac gggtcgaaat gaagatcccg ^ 

c 9g a gC cc tg at gg aa ttta aca 9 cca 9 at caaaca ttat gcgattttcc aggctatcaa ^ 
t c g c ggcCt9 caaaa^cc ^accc.ca cttccacaaa cacca 9 caaa 9 caaaa 9 c gt 

tattatcaaa c tctt c 9 a t c a t caa gctg c a ggactgtac aat 9 cccaa g taattttcat 32520 

ttctccactc 9 c 9 aa Cg a tg tcg c 9g caaa tagtctg aa g gttcatgccg tgcatattaa 32580 

aaa gctccga aagggcgccc tctatagcca tgcgtagaca ^ 

a t c ggg ctcc t 9 a 9 acacc t 9 ca 9 ca 9 at t t aaca 9 accc a ggt ca 9gtt gc tctcc g c g 32700 

a t c g c 9 a at c tccatcc g ca aa 9 tca tttg caaataatta aatagatctg cgccgactaa ^ 
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atctgttaac tccgcgctag gaactaaatc aggtgtggct acgcagcaca aaagttccag 32820 

ggat ggcgC c aaactcacta gaaccgctcc cgagtagcaa aactgatgaa tgggagtaac 32880 

acagtgtaaa atgttcagcc aaaaatcact aagctgctcc tttaaaaagt ccagtacttc 32940 

tatattcagt tcgtgcaagt actgaagcaa ctgtgcggga atatgcacag caaaaaaaat 33000 

agggcggctc agatacatgt tgacctaaaa taaaaagaat cattaaacta aagaagcccg 33060 

gcgaacggtg ggatatatga cacgctccag cagcaggcaa gcaaccggct gtccccggga 33120 

accgcggtaa aattcatccg aatgattaaa aagaacaaca gagacttccc accatgtact 33180 

cggttggatc tcctgagcac agagcaatac ccccctcaca ttcatatccg ctacagaaaa 33240 

aaaacgtccc agatacccag cgggaatatc caacgacagc tgcaaagaca gcaaaacaat 33300 

ccctctggga gcaatcacaa aatcctccgg tgaaaaaagc acatacatat tagaataacc 33360 

ctgttgctgg ggcaaaaagg cccgtcgtcc cagcaaatgc acataaatat gttcatcagc 33420 

cattgccccg tcttaccgcg taaacagcca cgaaaaaatc gagctaaaat ccacccaaca 33480 

gcctatagct atatatacac tccacccaat gacgctaata ccgcaccacc cacgaccaaa 33540 

gttcacccac acccacaaaa cccgcgaaaa cccagcgccg tcagcacttc cgcaatttca 33600 

gtctcacaac gtcacttccg cgcgcctttt cactttccca cacacgccct tcgcccgccc 33660 

gccctcgcgc caccccgcgt caccccacgt caccgcacgt caccccggcc ccgcctcgct 33720 

cctccccgct cattatcata ttggcacgtt tccagaataa ggtatattat tgatgcagca 33780 

aaacaatccc tctgggagca atcacaaaat cctccggtga aaaaagcaca tacatattag 33840 

aataaccctg ttgctggggc aaaaaggccc gtcgtcccag caaatgcaca taaatatgtt 33900 

catcagccat tgccccgtct taccgcgtaa acagccacga aaaaatcgag ctaaaatcca 33960 

cccaacagcc Catagctata tatacactcc acccaatgac gctaataccg caccacccac 34020 

gaccaaagtc cacccacacc cacaaaaccc gcgaaaatcc agcgccgtca gcacttccgc 34080 

aatttcagtc tcacaacgtc acttccgcgc gccttttcac tttcccacac acgcccttcg 34140 

cccgcccgcc ctcgcgccac cccgcgtcac cccacgtcac cgcacgtcac cccggccccg 34200 

cctcgctcct ccccgctcat tatcatattg gcacgtttcc agaataaggt atattattga 34260 
tgca 

34264 

<210> 5 
<211> 31044 
<212> DNA 
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<213> simian adenovirus SV-25 
<400> 5 

c«c atcaat aatatacett lttctgg ,„ c9tgccMca tgatMtw 
•MW cc^c 9t9COTt9ac 5cg999tsgc 

.-a**. 9^999.99 = 9CCM9ttt te , cgtat9c cctaMcc59 
atttg9ge9t , tactt9ta , 9t[ttgtgta ntt ^ ^ 

•ttMM, « 9a99tt „ taC9Cacttc ttat9actgg gcggaatttc 

C<,9C9a *°" <W« t c 9 c ta c gtg a c agtaC c a c 9a9M99 „ c 

"*~ " C " t9t ' C ■ c t9t c agat c 

*tc aa9agg c cactctt9a3 t3et99cgag aagagttttc 

•99C tg9 tcc c= 9a9at9ta C99t9Ctttc a9c9acgaga cggtgcgtaa 

cc9«9.a« oa9ac9c9ct 99aaatttcc aattcgcctg tgcetc ^ 

electee tt9t9tttt9 9 «„ acgct cageaacgtg teae>t>g3g ^ 
9c a99a9aC9 ca9ttt9Cta cgata<;tcct ggagcttttg 

c.« etetet tccaagagaa aactatcaM ^ 

"""^ 9 " 9C ""' «=«.«« »99«c. ac 99a9c , ac , g 

C9.c ag cc a c c t9 tc 9Cggg a9Cacat9cc 9gattacatg tcgatggcgc tgeggag9go 

a99a999ttt 9cat „ actc gcgggcg=ag cctccgeggc ^ 

99-99.99*9 9aacctgag9 9CC999ccg9 . accatccaac 

99a99.c ta9 9tg « gagga waccccgaa gag9ggacta gtgggg<;tag ^ 
«9act 9a9t ct9aacctcg aaaatttttg aacgagttga ctgcgagtct 
C9=9 t cc 99 a9acaatttt ctggtctgaa ttg9aggagg aattca9gag gggggajctg 
»cc tg c Cat acaa9tat99 9tctga , cag ttaaaaacto accggttgg> 3c=gtgggi3 
9.«tt 9 , aa cccccc^a c.ctcttoct...,^ t9cg9ccgga ^^^^ 
— tcc.cc 9 c a c t9tta . c ataaa9aa9 a9C9ttcatg ttataggcca tggagctctg 
9 t 9=. Mt9 c aaac c gt c 9a cco^c tttastt9ag gtatgoaa>a toegggoccc 
9999 C , aEa9 9 c ttaaat99 tgcaacattt cacaacgca> ggtteactgg tgaMgtttc 

"e^ccctc t gtttgcaaa ta .c a c.c a9 c , 9a c 9 ctcc a c 99 c 9Ceta cttttttaac 
« taataa c a c a t gCgtg9a gec9t9999c a999tgtctt ega9g9gccg ctgEtttcac 
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ggctgctgga 


aggcggtggt gggaagactt 


aaaagtgtaa 


catctgtaaa 


aaaatgcgtg 


1680 


tttgagcggt 


gtgtgttggc tttaactgtg 


gagggctgtg 


gacgcattag 


gaataatgcg 


1740 


gcgtctgaga 


atggatgttt tcttttgcta 


aaaggcacgg 


ctagtattaa 


gcataacatg 


1800 


atatgcggca 


gcggtctgta cccttcacag 


ctgttaactt 


gcgcggatgg 


aaactgtcag 


1860 


accttgcgca 


ccgtgcacat agcgtcccac 


cagcgccgcg 


cctggccaac 


attcgagcac 


1920 


aatatgctta 


tgcgttgtgc cgtccacttg 


ggccctaggc 


gaggcgtgtt 


tgtgccttac 


1980 


cagtgtaact 


ttagccatac caagatttta 


ctagaacctg 


ataccttctc 


tcgagtgtgt 


2040 


ttcaatgggg 


tgtttgacat gtcaatggaa 


ctgtttaaag 


tgataagata 


tgatgaatcc 


2100 


aagtctcgtt 


gtcgcccatg tgaatgcgga 


gctaatcatc 


tgaggttgta 


tcctgtaacc 


2160 


ctaaacgtta 


ccgaggagct gaggacggat 


caccacatgt 


tgtcctgcct 


gcgcaccgac 


2220 


tatgaatcca 


gcgacgagga gtgaggtgag 


gggcggagcc 


acaaagggta 


taaaggggcg 


2280 


tgaggggtgg 


gtgtgatgat tcaaaatgag 


cgggacgacg 


gacggcaacg 


cgtttgaggg 


2340 


tggagtgttc 


agcccttatc tgacatctcg 


tcttccttcc 


tgggcaggag 


tgcgtcagaa 


2400 


tgtagtgggc 


tccaccgtgg acggacgacc 


ggtcgcccct 


gcaaattccg 


ccaccctcac 


2460 


ctatgccacc 


gtgggatcat cgttggacac 


tgccgcggca 


gctgccgctt 


ctgctgccgc 


2520 


ttctactgct 


cgcggcatgg cggctgattt 


tggactgtat 


aaccaactgg 


ccactgcagc 


2580 


tgtggcgtct 


cggtctctgg ttcaagaaga 


tgccctgaat 


gtgatcctga 


ctcgcctgga 


2640 


gatcatgtca 


cgtcgcttgg acgaactggc 


tgcgcagata 


tcccaagcta 


accccgatac 


2700 


cacttcagaa 


tcctaaaata aagacaaaca 


aatatgttga 


aaagtaaaat 


ggctttattt 


2760 


gttttttttg 


gctcggtagg ctcgggtcca 


cctgtctcgg 


tcgttaagaa 


ctttgtgtat 


2820 


gttttccaaa 


acacggtaca gatgggcttg 


gatgttcaag 


tacatgggca 


tgaggccatc 


2880 


ttfc ggggtga 


agataggacc attgaagagc 


gtcatgctcc 


ggggtggtgt 


tgtaaattac 


2940 


ccagtcgtag 


cagggtttct gggcgtggaa 


ctggaagatg 


tcctttagga 


gtaggctgat 


3000 


ggccaagggc 


aggcccttag tgtaggtgtt 


tacaaagcgg 


ttaagctggg 


agggatgcat 


3060 


gcggggggag 


atgatatgca tcttggcttg 


gatcttgagg 


ttagctatgt 


taccacccag 


3120 


gtctctgcgg 


gggttcatgt tatgaaggac 


caccagcacg 


gtgtagccgg 


tgcatttggg 


3180 


gaacttgtca 


tgcagtttgg aggggaaggc 


gtggaagaat 


ttagagaccc 


ccttgtggcc 


3240 


ccctaggttt 


tccatgcact catccataat 


gatggcaatg 


ggacccctgg 


cggccgcttt 


3300 


ggcaaacacg 


ttttgggggt tggaaacatc 


atagttttgc 


tctagagtga 


gctcatcata 


3360 



87 



3420 
3480 
3540 



ggccatctta acaaagcggg gtaggagggt gcccgactgg gggatgatag ttccatctgg 
gcctggggcg tagttaccct cacagatctg catctcccag gccttaattt ccgagggggg 
tatea.tgtcc acctgggggg caataaagaa cacggtttct ggcgggggat tgatgagctg 
ggtggaaagc aagttacgca gcagttgaga tttgccacag ccggtggggc cgtagatgac 3600 
cccgatgacg ggttgcagct ggtagttgag agaggaacag ctgccgtcgg ggcgcaggag 3660 
gggggctacc tcattcatca tgcttctaac atgtttattt tcactcacta agttttgcaa 3720 
gagcctctcc ccacccaggg ataagagttc ttccaggctg ttgaagtgtt tcagcggttt 3780 
taggccgtcg gccatgggca tcttttcgag cgactgacga agcaagtaca gtcggtccca 3840 
gagctcggtg acgtgctcta tggaatctcg atccagcaga cttc Ct gg tt gcgggggttg 3900 
ggtcgacttt cgctgtaggg caccagccgg tgggcgtcca gggccgcgag ggttctgtcc 3960 
ttccagggtc Ccagcgtccg ggtgagggtg gtctcggtga cggtgaaggg atgagccccg 4020 
ggctgggcgc ttgcgagggt gcgcttcagg ctcatcctgc tggtgctgaa gcggacgtcg 
tctccctgtg agtcggccag atagcaacga agcatgaggt cgtagctgag ggactcggcc 
gcgtgtccct tggcgcgcag ctttcccttg gaaacgtgct gacatttggt gcagtgcaga 
cattggaggg cgtagagttt gggggccagg aagaccgact cgggcgagta ggcgtcggct 
ccgcactgag cgcagacggt ctcgcactcc actagccacg tgagctcggg tttagcggga 
tcaaaaacca agttgcctcc attttttttg atgcgtttct taccttgcgt ttccatgagt 
ttgtggcccg cttccgtgac aaaaaggctg tcggtgtctc cgtagacaga cttgaggggg 
cgatcctcca aaggtgtccc gaggtcttcc gcgtacagga actgggacca ctccgagacg 4500 
aaggctctgg tccaggctaa cacgaaggag gcaatctgcg aggggtatct gtcgttttca 4560 
atgagggggt ccaccttttc cagggtgtgc agacacaggt cgtcctcctc cgcgtccacg 4620 
aaggtgattg gcttgtaagt gtaggtcacg tgatctgcac cccccaaagg ggtataaaag 4680 
ggggcgtgcc caccctctcc gtcactttct tccgcatcgc tgtggaccag agccagctgt 4740 
tcgggtgagt aggccctctc aaaagccggc atgatctcgg cgctcaagtt gtcagtttct 4800 
acaaacgagg tgga ttt gat attcacgtgc cccgcggcga tgcttttgat ggtggagggg 4860 
tccatctgat cagaaaacac gatctttttg ttgtcaagtt tggtggcgaa agacccgtag 4920 
agggcgttgg aaagcaactt ggcgatggag cgcagggtct gatttttctc ccgatcggcc 4980 
ctctccttgg cggcgatgtt gagttgcacg tactcccggg ccgcgcaccg ccactcgggg 



4080 
4140 
4200 
4260 
4320 
4380 
4440 



5040 
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cgggacgctg 


cctttgcaga 


acagcatccg 


cggagttcag 


cgagttacca 


tcacggacgc 


14880 


ccgccgccgg 


acctgtccct 


acgtctacaa 


agccttggga 


atcgtggccc 


cgcgcgtcct 


14940 


gtcgagtcgc 


actttctaga 


tgtccatcct 


catctctccc 


agcaacaata 


ccggttgggg 


15000 


tctgggcgtg 


accaaaatgt 


acggaggcgc 


caaacgacgg 


tccccacaac 


atcccgtgcg 


15060 


agtgcgcggg 


cactttagag 


ccccatgggg 


gtcgcacacg 


cgcgggcgca 


ccggccgaac 


15120 


caccgtcgac 


gacgtgatcg 


atagcgtggt 


ggccgacgcc 


cgcaactacc 


agcccgctcg 


15180 


atccacggtg 


gacgaagtca 


tcgacggcgt 


ggtggccgac 


gccagggcct 


acgcccgcag 


15240 


aaagtctcgt 


ctgcgccgcc 


gccgttcgct 


aaagcgcccc 


acggccgcca 


tgaaagccgc 


15300 


tcgctctctg 


ctgcgtcgcg 


cacgtatcgt 


gggtcgccgc 


gccgccagac 


gcgcagccgc 


15360 


caacgccgcc 


gccggccgag 


tgcgccgccg 


ggccgcccag 


caggccgccg 


ccgccatctc 


15420 


cagtctatcc 


gccccccgac 


9cgggaatgt 


gtactgggtc 


agggactcgg 


ccaccggcgt 


15480 


gcgagttccc 


gtgagaaccc 


gtcctcctcg 


tccctgaata 


aaaagttcta 


agcccaatcg 


15540 


gtgttccgtt 


gtgtgttcag 


ctcgtcatga 


ccaaacgcaa 


gtttaaagag 


gagctgctgc 


15600 


aagcgctggt 


ccccgaaatc 


tatgcgccgg 


cgccggacgt 


gaaaccgcgt 


cgcgtgaaac 


15660 


gcgtgaagaa 


gcaggaaaag 


ctagagacaa 


aagaggaggc 


ggtggcgttg 


ggagacgggg 


• 15720 


asgtggagtt 


tgtgcgctcg 


ttcgcgccgc 


gtcggcgagt 


gaattggaag 


gggcgcaagg 


15780 


tgcaacgggt 


gctgcgtccc 


ggcacggtgg 


tgtcttccac 


cccgggtgaa 


aaatccgcct 


15840 


ggaagggcat 


aaagcgcgtg 


tacgatgagg 


tgtacgggga 


cgaagacatt 


ctggagcagg 


15900 


cgctggatag 


aagcggggag 


tttgcttacg 


gcaagagggc 


gaggacgggc 


gagatcgcca 


15960 


tcccgctgga 


cacttccaac 


cccaccccca 


gtctgaaacc 


cgtgacgctg 


caacaggtgt 


16020 


tgccggtgag 


cgccccctcg 


cgacgcggca 


taaaacgcga 


gggcggcgag 


ctgcagccca 


16080 


ccatgcagct 


cctggttccc 


aagaggcaga 


aactagagga 


cgtactggac 


atgataaaaa 


16140 


tggagcccga 


cgtgcagccc 


gatattaaaa 


tccgtcccat 


caaagaagtg 


gcgccgggaa 


16200 


tgggcgtgca 


gaccgtggac 


atccagattc 


ccatgaccag 


cgccgcacag 


gcggtagagg 


16260 


ccatgcagac 


cgacgtgggg 


atgatgacgg 


acctgcccgc 


agctgctgcc 


gccgtggcca 


16320 


gcgccgcgac 


gcaaacggaa 


gccggcatgc 


agaccgaccc 


gtggacggag 


gcgcccgtgc 


16360 


agccggccag 


aagacgcgtc 


agacggacgt 


acggccccgt 


ttctggcata 


atgccggagt 


16440 


acgcgctgca 


tccttccatc 


atccccaccc 


ccggctaccg 


ggggcgcacc 


taccgtccgc 


16500 
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gacgcagcac cactcgccgc cgtcgccgca cggcacgagt cgccaccgcc agagtgagac 16560 

gcgtaacgac acgtcgcggc cgccgcttga ccctgcccgt ggtgcgctac catcccagca 16620 

ttctttaaaa aaccgctcct acgttgcaga tgggcaagct tacttgtcga ctccgtatgg 16680 

ccgtgcccgg ctaccgagga agatcccgcc gacgacggac tttgggaggc agcggtttgc 16740 

gccgccgtcg ggcggttcac cggcgcctca agggaggcat tctgccggcc ctgatcccca 16800 

taatcgccgc agccatcggg gccattcccg gaatcgccag cgtagcggtg caggctagcc 16860 

agcgccactg attttactaa ccctgtcggt cgcgccgtct ctttcggcag actcaacgcc 16920 

cagcatggaa gacatcaatt tctcctctct ggccccgcgg cacggcacgc ggccgtatat 16980 

ggggacgtgg agcgagatcg gcacgaacca gatgaacggg ggcgctttca attggagcgg 17040 

tgcgtggagc ggcttgaaaa atttcggttc cactctgaaa acttacggca accgggtgtg 17100 

gaactccagc acggggcaga tgctgaggga caagctaaag gacacgcagt ttcagcaaaa 17160 

ggtggtggac ggcatcgctt cgggcctcaa cggcgccgtc gacctggcca accaggccat 17220 

tcaaaaggaa attaacagcc gcctggagcc gcggccgcag gtggaggaga acctgccccc 17280 

tctggaggcg ctgcccccca agggagagaa gcgcccgcgg cccgacatgg aggagacgct 17340 

agttactaag agcgaggagc cgccatcata cgaggaggcg gtgggtagct cgcagctgcc 17400 

gtccctcacg ctgaagccca ccacctatcc catgaccaag cccatcgcct ccatggcgcg 17460 

ccccgtggga gtcgacccgc ccatcgacgc ggtggccact ttggacctgc cgcgccccga 17520 

acccggcaac cgcgtgcctc ccgtccccat cgctccgccg gtttctcgcc ccgccatccg 17580 

ccccgtcgcc gtggccactc cccgctatcc gagccgcaac gccaactggc agaccaccct 17640 

caacagtatt gtcggactgg gggtgaagtc tctgaagcgc cgtcgctgtt tttaaagcac 17700 

aatttattaa acgagtagcc ctgtcttaat ccatcgttgt atgtgtgcct atatcacgcg 17760 

ttcagagcct gaccgtccgt caagatggcc actccgtcga tgatgccgca gtggtcgtac 17820 

atgcacatcg ccgggcagga cgcctcggag tacctgagcc cgggtctggt gcagtttgcc 17880 

cgtgcgacgg aaacctactt ctcactgggc aacaagttca ggaaccccac cgtggcgccc 17940 

acccacgacg tcaccaccga tcggtcccag cgactgacaa tccgcttcgt ccccgtggac 18000 

aaggaagaca ccgcttactc ctacaaaacc cgcttcacgc tggccgtggg cgacaaccgg 18060 

gtgctagaca tggccagtac ctactttgac atccgcggcg tgatcgaccg cggacctagc 18120 

ttcaagcctt actccggcac ggcttacaac tcactggctc ccaaaggggc gcccaacaac 18180 

agccaatgga acgccacaga taacgggaac aagccagtgt gttttgctca ggcagctttt 18240 
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ataggtcaaa gcattacaaa agacgga 9tg caaataca.a ac tC a g aaaa tC aaca ggct l 83 00 

3 c tg cc 9 aca aaacttacca acca gag cc t caaat tg9 a 9 tttccacct9 g9ataccaac „,„ 

9 ttacca gt a ac 9 ct 9 cc 99 ac 9 a gtgtta aaagccacca ctcccatgct 9 cca tgttaC 1 8420 

99tt ca taCg ccaatcccac taatccaaac gggggt ca gg caaaaaca 9 a aggagacatt 18 480 

tc 9 ctaaact ttttcacaac aactgcggca 9 ca 9 acaata atcccaaagt 9gt tctttac 185 40 

agcgaagatg taaaccttca a 9 ccccc g a t actcacttag t atataa g cc aac 9gtggga X8600 

9 aaaac 9tt a tc 9 cc g ca ga a g ccc tgcta ac g ca g ca gg c gtgtcccaa cagagcaaac la660 

tacata ggCt tccgagataa ctttatcggt ttaatgtatt ataacagcac a ggg aacat g i 8720 

gg a gCtct gg cag 9tC aggc ctcgcagtta aaC gcag tt g tagacctgca agatcgaaac ^ 

acggaactgt cctatcagct aatgctagat g c tCtgggtg acagaactcg at . tttctc . 18Q40 

atgtggaatc a g9 cc gtgga cagctacgat ccagac gtta 9gattatcga gaaccatggg 18900 

9 tggaagac 9 a 9 c tg c 9 caa t t ac tgtttt ccactcccag gcatgggtat ttttaactcc le960 

tacaa 9gggg taaaaccaca aaatggcggt aatggtaact gggaagcaaa cggggaccta ^ 

tcaaatgcca atgagatcgc ttt aggaaac atttttgcca tggaaattaa cctccacgca l 9080 

aacctgtggc gcagcttctt gtacagcaat 9t9gc9ctgt acctgccaga cagctataaa 1 9 140 

ttcactcccg ctaacatcac tc t gcccgcc aaccaaaaca cctacgagta tatcaacgg, 1920 0 

c g c 9 tcac tt ctccaaccct 9gtg9ac acc t tt g ttaaca ttggagcccg atggtcgccg 19260 

9 a t ccca tgg acaacgtcaa ccccettaac c a tcacc gga ac 9 c ggg cc t cc g ttacc 9 c l 9320 

tccatgctgc t g99 aaa Cgg acgcgtggtg cctttccaca tacaagtgcc gcaaaaactt i 9380 

tt c 9 cg atta agaacctcct gct tttg ccc ggctcctaca c tt ac gagtg gagcttcaga 19440 

aaa gaCgtg a acatgattct gcagagcacc ct ggg caa tg atcttcgagt gg ac gggg cc 19500 

agcgtccgca ttgacagcgt caacttgtac gccaactut tccccatggc gcacaacacc 19560 

gcttctacct tggaagccat gc t gc 9 a aac gacaccaacg acca gtC g tt taacgactac l 962 o 

ctcagcgcgg ccaacatg.t ttatcccatt ccggccaacg ccaccaacgt tcccatttcc l 9680 

attccctccc gcaactgggc ggccttccgg 9g at gg a gct tcacccgcct taaagccaag l 9 740 

gaaacgcctt ccttgggctc cggctttgac ccctactttg tg tactca gg caccattcct l 9800 

tacc t99 ac 9 gcagctttta cctcaaccac actttcaaac gtctgtccat catgttcgat l 9860 

tcttccgtaa gttggccggg caacgaccgc ctcctgacgc cgaacgagtt cgaaattaag l 9920 



115 



c 9 catt gt39 ac ggggaag9 ctacaacgtg gctcaaagta acatgaccaa a g ac tggttt 19980 

ttaattcaaa tgctcagcc. ctacaacatc ggctaccaag gcttctatgt tccc gaggge 20040 

tacaaggatc gg at g tattc tttcttccga aacttccagc ccatgagccg ccaggtgccg 20100 

gatcccaccg ctgccggcta t caagcc gtt cccctgccca gacaacacaa caactcgggc 20160 

tttgtggggt acatgggccc gaccatgcgc gaaggacagc catacccggc caactacccc 20220 

tatcccctga tc gg cgc t ac cgccgtcccc gccattaccc agaaaaagtt tttgCgcgac 202eo 

cgcgtcatgt ggcgcatacc tttttccagc aactttatgt caa t gg 9 g gc cctgaccgac 20340 

ctcggacaga aca tgcttta cgct.actcc gcccatgccc tggatatgac tt ttgaggtg 20400 

g accccatga ac g a 9 cccac gttgctgtac atgctttttg a ggt gtt c g a c 9tg9tcaga 20460 

gtgcacc.gc c gca cc 9Cg g tattatcgag gccgtgtacc tgcgcacccc cttctctgcg 20520 

gg-atgcca ccacataagc cgctgaacta gctggttttt accccag.tc ccatgggctc 20580 

cacggaagac gaactgcggg ccattgtgcg agacctgggc tgcggaccct ac ttcct gg 9 20640 

cacctttgac aagcggtctc ccgggttcgt gtctcctcgc aaactcgcgt gcgcgatcgt 20700 

ga a tacc g cc g 9 ccg aga g a ccg gaggaga gcattggcta gctctgggct 9g aaccccc g 20760 

ctcgtccacg tttttcc^t tcgacccctt tggcttttca gaccaacgct . tgaagcagat 20B20 

ctatgcattt gaatatgagg gtctactcaa gcgaagcgcg ctggcctcct ccgccgatca 20880 

c t g tCt aacc ctggtaaaga gcactcagac ggttcagggc cctcacagcg ccgcctgtgg 20940 

cctt CCtt g C tgcatgtttt tgcacgcctt tgtgaactgg ccgg aca ccc ccatggaaaa 21000 

caaccccacc atggacctcc tgactggcgt tcccaactcc at g ctcca aa g cccc ag cgt 21060 

gc.gacc.cc ctcctccaaa accagaaaaa tctgtacgcc tttctgcaca agcactctcc 21120 

ctactttcgc c g cca t c ggg aacaaataga aaatgcaacc gcgtttaaca aaactctgta 21180 

acgtttaata aatgaacttt ttattgaact ggaaaacggg tttgtg attt ttaaaaatca 21240 

aagggg ttga gctgg.c.tc catgtgggag gccgg.aggg tggtgttctt gtactggtac 21300 

ttgggcagcc acttaaactc tggaatcac. aacttgggca gcggtatttc tgggaagttg 21360 

tcgtgccaca gctggcgggt cagctgaagt gcctgcagaa catcgggggc gg a g atctt g 21420 

aagtcgcagt ttatctggtt cacggcacgc gcg ttgcggt acatgggatt g g CaC actga 21480 

aacaccagca ggc t ggg att cttgatgcta gcca gggcca cggcgtcggt cacgtcaccg 21540 

gtgtcttcta tgtcggacag C g aa aaaggc gtgactttgc aaagctggcg tcccgcgcga 21600 

ggcacgcaat ctcccagg ta gtt g Ca c t c a cagcggatgg gcagaagaag atgcttgtgg 21660 
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ccgc g9gt ca t 9tagggat a ggC c 9 c tg cc ataaaa g c tt c gat c tg cc t g aaa g cc tgc 21720 

t tg9 cc tt9t 9Cccttc9gt ataaaaaaca ccgcaggacfc ^^^^^ ^^^^^ 2i7 ^ 

g c 3 caa g c gg catcgtgaaa 9 ca agcg c g t g c 9 tc ttcgt ttcgtaactg caccacgctg 21840 

c gg ccccacc 99ttct9 aa t cacctt gg cc cegecggggt t ttC ct tg a g a 9 c g c g c tgg 2190 0 

cc g9 cttc g c t9 cccacatc c atttccacg acatgctcct ^ 

t ggagg ca gc gcagctcctc g tca t c gt c g gCg ca gtgat gctcccacac g ac g ca 9 cca 220 20 

9tggg ctccc ac tt99gctt ggaggcctcg gcaatgccag aatacaggag aacgtagtgg 22080 

tgcagaaaac gtcccatcat ggtgC caaa g gttttctggc tgCtg aa ggt c at c 9ggc a g 22140 

tacctccagt cc t c gttaag cc aagtgttg cagatcttcc tgaagaccgt gcacCgatcg 222Q0 

93 ca t a aagt ggaactcatt gcgctcggtc ttgtcgatct tacacttttc catcagacta 2226o 

tgcataatct cca t9 ccctt tt ccc agg c 9 ca aa c aatct tggtgcCaca cgggctaggt 2232Q 

at 9g ccaaa g t ggttggcct ctg a ggcgg c gcttgttctt C c t c t t gagc cctctccc.a 223 eo 

ct 9 ac ggggg ttgaaagagg gtgccccttg gggaacggct tgaacacggt ctggcccgag 22440 

9C9tCCC98a gaatCt ^ Cat c 9ggggattg c t 33 cc 3 tca tggcgatgat ctg acccc gg 225 00 

" CteCtCCaCttC9tCCtC " C ^ actt t ttCgggg9 a cggt ac ggg a 22S6 0 

9t a 9ggggaa gagcgcggcg cgccttcttc ttgggcggca gttccggagc ctgctcttga 2262Q 

c 3 act 33 cca t tgt cttctc c t a g3 caa g a aaaacaagat ggaagactct ttctecteet 22680 

cc t c gt c aac gt ca 3 aaa gc 3 a gtctC cca ccttaagcgc cgagaactcc ca g c g cata g 22740 

aatcc gatgt 99g c tacgag actcccccc, c g a acttt tc g cc 9 cccccc ataaaC ac ta 22S 00 

ac 9ggtggac gg accacc tg g ccc taggag acgtactg c t gaagcacatc aggcggcaga 22860 

3 c gttat c gt 9 ca agatgct ct c a cc gagc ga c t c gcggt tccgctggaa gtggcggaac 2292Q 

ttagcgccgc cUcgagcga accctcttct ccccaaagac tccccccaa g a gg ca ggct a 2298 0 

ac gg cacct 9 cgagcctaac cctcgactca acctctaccc tgcctttgcc gtgccagagg 23 040 

tactggctac gt accaca tt tttttccaaa accacaaaat ccc t ctctc g t 9 cc g c g cca 2310 0 

aCC9CaCCM gtg ct gcga c tggaggaagg ggctcgcatacctgagattg 23 l 60 

c 3tgtCtgg a gg aa g tccca a aaatetttg aag9tctggg ccgcgacgaa aagcgagcag ^ 

caaacgctct ggaagagaac g ca gag a g tc acaaca gcg c ctt ggtagaa ct c gagggcg 23280 

acaacgocag act ggcc3 tc ctcaaacggt cc atagaagt cacgcacttc 3 cctacccc 3 23340 
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ccgttaacct 


ccctccaaaa 


gttatgacag 


cggtcatgga 


ctcgctgctc 


ataaagcgcg 


23400 


ctcagccctt 


agacccagag 


cacgaaaaca 


acagtgacga 


aggaaaaccg 


gtggtttctg 


23460 


atgaggagtt 


gagcaagtgg 


ctgtcctcca 


acgaccccgc 


cacgttggag 


gaacgaagaa 


23520 


aaaccatgat 


ggccgtggtg 


ctagttaccg 


tgcaattaga 


atgtctgcag 


aggttctttt 


23580 


cccacccaga 


gaccctgaga 


aaagtggagg 


aaacgctgca 


ctacacattt 


aggcacggct 


23640 


acgtgaagca 


agcctgcaag 


atttccaacg 


tagaacttag 


caacctcatc 


tcctacctgg 


23700 


ggatcttgca 


cgaaaaccgc 


ctcggacaaa 


acgtgctgca 


cagcacactg 


aaaggagaag 


23760 


cccgccgaga 


ctatgtgcga 


gactgcgtgt 


tcctagcgct 


agtgtacacc 


tggcagagcg 


23820 


gaatgggagt 


ctggcagcag 


tgcctggagg 


acgaaaacct 


caaagagctt 


gaaaagctgc 


238B0 


tggtgcgctc 


cagaagggca 


ctgtggacca 


gttttgacga 


gcgcaccgcc 


gcgcgagacc 


23940 


tagctgatat 


tatttttcct 


cccaagctgg 


tgcagactct 


ccgggaagga 


ctgccagatt 


24000 


ttatgagtca 


aagcatcttg 


caaaacttcc 


gctctttcat 


cttggaacgc 


tcgggaatct 


24060 


tgcccgccac 


tagctgcgcc 


ctacccacag 


attttgtgcc 


tctccactac 


cgcgaatgcc 


24120 


caccgccgct 


gtggccgtac 


acttacttgc 


ttaaactggc 


caactttcta 


atgttccact 


24180 


ctgacctggc 


agaagacgtt 


agcggcgagg 


ggctgctaga 


atgccactgc 


cgctgcaacc 


24240 


tgtgcacccc 


ccaccgctct 


ctagtatgca 


acactcccct 


gctcaatgag 


acccagatca 


24300 


tcggtacctt 


tgaaatccag 


ggaccctccg 


acgcggaaaa 


cggcaagcag 


gggtctgggc 


24360 


taaaactcac 


agccggactg 


tggacctccg 


cctacttgcg 


caaatttgta 


ccagaagact 


24420 


atcacgccca 


ccaaattaaa 


ttttacgaaa 


accaatcaaa 


accacccaaa 


agcgagttaa 


24480 


cggcttgcgt 


cattacgcag 


agcagcatag 


ttgggcagtt 


gcaagccatt 


aacaaagcgc 


24540 


ggcaagagtt 


tctcctaaaa 


aaaggaaaag 


gggtctactt 


ggacccccag 


accggcgagg 


24600 


aactcaacgg 


accctcctca 


gtcgcaggtt 


gtgtgcccca 


tgccgcccaa 


aaagaacacc 


24660 


tcgcagtgga 


acatgccaga 


gacggaggaa 


gaggagtgga 


gcagtgtgag 


caacagcgaa 


24720 


acggaggaag 


agccgtggcc 


cgaggggtgc 


aacggggaag 


aggacacgga 


gggacggcqa 


24780 


agtcttcgcc 


gaagaactct 


cgccgctgcc 


cccgaagtcc 


cagccggccg 


cctcggccca 


24840 


agatcccgca 


cacacccgta 


gatgggatag 


caagaccaaa 


aagccgggta 


agagaaacgc 


24900 


tcgcccccgc 


cagggctacc 


gctcgtggag 


aaagcacaaa 


aactgcatct 


tatcgtgctt 


24960 


gctccagtgc 


ggcggagacg 


tttcgttcac 


ccgtagatac 


ttgcttttta 


acaaaggggt 


25020 


ggccgtcccc 


cgtaacgtcc 


tccactacta 


ccgtcactct 


tacagctccg 


aagcggacgg 


25080 
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ctaagaaaac 3 ca 9 ca gttg ccggcgggag 3 act 3 c 3 tc t ca 3 c 3 ccc 3 a 3 aaccccca 3 251 40 
ccacca 333 a gct cc 3 aaac c g ca tatttc ccaccctcta c 3 c t atc ttt cag caaa 3 cc 252 00 
333g3 ca 3 ca 3 caa 9 aac t3 aaaataaaaa accgcacgct 3 a 33tC3 ctt accc 3 aa 3 ct 25260 
gcctctatca caa 9 a 9 c g aa gagcagctgc agcgaaccct 33 a 33 ac 3 ca 3 aa 3 cgct 3t 25320 
tccagaagta c tg c 3 = 3 acc accctaaata actaaaaaag ccc 3 c 3 c 3 c 3 3g ac ttC aaa 2 53B0 
cc 3 tct 3 ac 3 tcacca 3 cc 3 c 3 c 3 ccaaaa tgagcaaaga gattcccacg ccttacatgt 2S4 40 
33 a 3 ttacca 3 cc 3 ca 3 at 3 33 a tt a 3 cc 3 ccg 3 c 9 cc 3 c ccaggattac tccacgaaaa 255 00 
t 3 aact 33 ct cagcgccggg ccccacatga tttcccgcgt aaacgacatt cgcgcccacc 25 560 
gcaatcagct attgttagaa caggctgctc tgaccgccac gccccgtaat aacctgaacc 25620 
ctcccagctg gccagctgcc ct ggt3tacc agg a aacgcc tcca ccc a cc agcgtacttt 25680 
tgcccc 3t3 a cgccca 9gcg gaagtccaga tg ac taacg c gggcgcgcaa ttagcgggcg 25 740 
gatcccggtt tcggtacaga gttcacggcg ccgcaccc.a ta 3 ccca 33 t ataaa 3 a 3 gc 25 800 
tgatcattcg a gg ca gag g C gtccagctca acgacgagac agtgagctct tcgcttggtc 2 5860 
tacgaccaga c gg a 9 t g ttc cagctcgcgg g c t c gggccg ctcttcgttc acgcctcgcc 25920 
a gg catacc t g actc t gcag agctctg** ctcagcctcg ctcgggagga atcggacccc 2S9B0 
ttcagtttgt g 3 a 3gagttt gtgccctcgg tctactttca gcctttcfccc 93atC9cccg 
gccagtaccc ggacgagttc atccccaact tcgacgcgg t gagtgactct gtggacggtt 26100 
atgactgatg tcgagcccgc ttcagtgcta gtgga.caag cgc g9 ctcaa tcacctggtt 26160 
cgttgccgcc gccgctgctg c gtg3 ctcgc gacttgagct tagctctcaa gtttgtaaaa 2622 0 
aacccgtccg aa a cc ggg ag cgctgtgcac g ggtt9ga9c t ag tgggtcc tg a g aa 39 cc 262 80 
accatccacg ttctcagaaa c ttt g tggaa aaacccattt t ggttaaa c g a 3 a C caggg g 263 40 
ccttttgtaa tcagcttact ctgcacctgt aacca tgttg accttcacga ctattttatg 26 400 
Satcatttgt 3 c 3 ct 3 aa tt caataagtaa a 3 c 3 aa t tct taccaa 3 a tt atgatgtcca . 26 460 
t 3 ac t3 ttcc tc 3 ccactat acgatgttgt gccagtaaac tctcttgtcg acatctatct 265.0' 
gaactgttcc ttttggtccg cacagcttac ttggtactac gg t g aca cc g tcctttctgg 265 80 
ctcactgggc agctcacacg gaataacact tcacctcttt tcgccgtttc gatacggaaa 26640 
ctacagctgt cgtgccggta cctgcctcca cgttttcaat cttcagccct gtccaccgac 26700 
caaacttgta tttgtcgact ctaagcactt acagctcaac tgcagcattc taggccccag 26760 
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tatcttgtgg acatacaata 


aaatcaggtt 


ggtggaattt 


gtctactacc 


cacccagcgc 


26820 


ccgcggtttt ggggaaattc 


ctttccagat 


ctactacaac 


tatcttgcca 


cacattatgc 


26880 


aagtcaacag caaccaaact 


tgcaagcacc 


cttcacgcca 


ggagagtact 


cctgtcacgt 


26940 


aggctcctgc acagaaactt 


ttattctctt 


caacagatct 


tctgccattg 


aacgcttcac 


27000 


tactaactac tttagaaacc 


aagttgtgct 


tttcactgac 


gaaaccccta 


acgtcaccct 


27060 


ggactgtgca tgtttttctc 


atgacaccgt 


aacttggact 


cttaacaata 


ctctctggct 


27120 


cgcgttcgat aaccaaagct 


tgat.tgttaa 


aaattttgat 


ttaaccttta 


ctaaaccctc 


27180 


tcctcgcgaa atagttatct 


ttgctccttt 


taatccaaaa 


actaccttag 


cctgtcaggt 


27240 


tttgtttaag ccttgccaaa 


caaactttaa 


gtttgtttat 


ttgcctccgc 


aatctgtcaa 


27300 


actcatagaa aaatacaaca 


aagcgcccgt 


cttggctcct 


aaaaccttct 


accactggct 


27360 


aacctacacg gggctgtttg 


cactaattgt 


ttttttccta 


attaacattt 


ttatatgttt 


27420 


cttgccttcc tccttctttt 


cgcgaacacc 


gttgccgcag 


aaagacctct 


ccttattact 


27480 


gtagcgcttg ctatacaaaa 


ccaagagtgg 


tcaaccgtgc 


tctcaatcta 


ttttcaattt 


27540 


ttcattttgt ccttaatact 


ttctcttatt 


gtcgttaaca 


atgatctgga 


gcattggtct 


27600 


cgcctttttt tggctgctta 


gtgcaaaagc 


cactattttt 


cacaggtatg 


tggaagaagg 


27660 


aactagcacc ctctttacga 


tacctgaaac 


aattaaggcg 


gctgatgaag 


tttcttggta 


27720 


caaaggctcg ctctcagacg 


gcaaccactc 


attctcagga 


cagacccttt 


gcatccaaga 


27780 


aacttatttt aaatcagaac 


tacaatacag 


ctgcataaaa 


aactttttcc 


atctctacaa 


27840 


catctcaaaa ccctatgagg 


gtatttacaa 


tgccaaggtt 


tcagacaact 


ccagcacacg 


27900 


gaacttttac tttaatctga 


cagttattaa 


agcaatttcc 


attcctatct 


gtgagtttag 


27960 


ctcccagttt ctttctgaaa 


cctactgttt 


aattactata 


aactgcacta 


aaaatcgcct 


28020 


tcacaccacc ataatctaca 


atcacacaca 


atcaccttgg 


gttttaaacc 


taaaattttc 


28080 


tccacacatg ccttcgcaat 


ttctcacgca 


agttaccgtc 


tctaacataa 


gcaagcagtt 


28140 


tggcttttac tatcctttcc 


acgaactgtg 


cgaaataatt 


gaagccgaat 


atgaaccaga 


28200 


ctactttact tacattgcca 


ttggtgtaat 


cgttgtttgc 


ctttgctttg 


ttattqqqqq 


28260 


gtgtgtttat ttgtacattc 


agagaaaaat 


attgctctcg 


ctgtgctcct 


gcggttacaa 


28320 


agcagaagaa agaattaaaa 


tctctacact 


ttattaatgt 


tttccagaaa 


tggcaaaact 


28380 


aacgctccta cttttgcttc 


tcacgccggt 


gacgcttttt 


accatcactt 


tttctgccgc 


28440 


cgccacactc gaacctcaat 


gtttgccacc 


ggttgaagtc 


tactttgtct 


acgtgttgct 


28500 
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gtgctgcgtt agcgtttgca gtataacatg ttttaccttt gtttttcttc agtgcattga 28560 

ctacttctgg gtcagactct actaccgcag acacgcgcct cagtatcaaa atcaacaaat 28620 

tgccagacta ctcggtctgc catgattgtc ttgtatttta ccctgatttt ttttcacctt 28680 

acttgcgctt gtgattttca cttcactcaa ttttggaaaa cgcaatgctt cgacccgcgc 28740 

ctctccaacg actggatgat ggctcttgca attgccacgc ttggggcgtt tggacttttt 28800 

agtggttttg ctttgcatta caaatttaag actccatgga cacatggctt tctttcagat 28860 

tttccagtta cacctactcc gccgcctccc ccggccatcg acgtgcctca ggttccctca 28920 

ccttctccat ctgtctgcag ctactttcat ctgtaatggc cgacctagaa tttgacggag 28980 

tgcaatctga gcaaagggct atacacttcc aacgccagtc ggaccgcgaa cgcaaaaaca 29040 

gagagctgca aaccatacaa aacacccacc aatgtaaacg cgggatattt tgtattgtaa 29100 

aacaagctaa gctccactac gagcttctat ctggcaacga ccacgagctc caatacgtgg 29160 

tcgatcagca gcgtcaaacc tgtgtattct taattggagt ttcccccatt aaagttactc 29220 

aaaccaaggg tgaaaccaag ggaaccataa ggtgctcatg tcacctgtca gaatgccttt 29280 

acactctagt taaaacccta tgtggcttac atgattctat cccctttaat taaataaact 29340 

tactttaaat ctgcaatcac ttcttcgtcc ttgtttttgt cgccatccag cagcaccacc 2940.0 

ttcccctctt cccaactttc atagcatatt ttccgaaaag aggcgtactt tcgccacacc 29460 

ttaaagggaa cgtttacttc gctttcaagc tctcccacga ttttcattgc agatatgaaa 29520 

cgcgccaaag tggaagaagg atttaacccc gtttatccct atggatattc tactccgact 29580 

gacgtggctc ctccctttgt agcctctgac ggtcttcaag aaaacccacc tggggtcttg 29640 

tccctaaaaa tatccaaacc tttaactttt aatgcctcca aggctctaag cctggctatt 29700 

ggtccaggat taaaaattca agatggtaaa ctagtggggg agggacaagc aattcttgca 29760 

aacctgccgc ttcaaatcac caacaacaca atttcactac gttttgggaa cacacttgcc 29820 

ttgaatgaca ataatgaact ccaaaccaca ctaaaatctt catcgcccct taaaatcaca 29880 

gaccagactc tgtcccttaa cataggggac agccttgcaa ttaaagatga caaactagaa 2994 0 

agcgctcttc aagcgaccct cccactctcc attagcaaca acaccatcag cctcaacgtg 30000 

ggcaccggac tcaccataaa tggaaacgtt ttacaagctg ttcccttaaa tgctctaagt 30060 

cccctaacta tttccaacaa taacatcagc ctgcgctatg gcagttccct gacggtgctt 30120 

aacaatgaac tgcaaagcaa cctcacagtt cactcccctt taaaactcaa ctccaacaac 30180 
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tcaatttctc tcaacactct atctcc 3t tt a g aa tCg a 9 a .tggtttcct cacgctctat , Q24Q 

tt g33 aacaa aatc t3gctt 3ct a 3tt caa aaca 3tggct taaaagttca a gcgggc tac 30300 

39 cc t3 caa 3 ta aca 3 acac caatgctctc aca tt aa 3 at atctc.ctcc ac tgaccatt 30360 

cca g actc 3g 3 ctpa g aaca a g3 ca t tc tt aaa 3t aaaca ct 33 aca 3gg CCt aa gtgtg 30420 

aaccaa 3Ctg 3 a 3 c 3 ctt g a aacatcccta gg a ggtggat taaaatatgc tgataacaaa 30080 

ataacct ttg ataca 3g aaa c gg act 3 aca ttatctgaaa ataaacttgc a 3 ta 3CtgC a 30540 

9 9t a gtggtc C aac ttttag ag at ggCg cc tt33tagcca cgggaaccgc atttacgcaa ^ 

acac t3t39a ctac 33ct9 a t cc 3 tctccc aa Ct3 cacaa ttat aca 3ga cc 3 c 3 ac a c a 30**0 

a.atttactt t 33 c 3 c tt ac ca ttagtggg a 3 cc aagt3 c t ggggacgg t ttccattatt 307 20 

33 a 3 taa aa3 9 ccccc tttc aagtagcata ccgtcagcfca ccgctacagt ^ 

« t3 a tt cca ac 3gag cc=t att g a 9 c t cc tcttcactta aa 3gttactg ggggtatcgc 3 0840 

ca aggt ccc t caat tgaccc ttaccccata attaatgccfc ^ 

ct 3gcttat c cccc 333 aca a.aaatccaa 9 ca aa a tgt a aca tgt ac gt ttctactttt 30960 

ttac g a 3gaa atccacaaa, accaatagtt ttaaacatca cttttaata& tcaMccagc ^ 

3ggtt t t cca t ta 3 atttac a tgg ac aaat ,taaccaca 3 gagaagcatt tgcaatgccc 31080 

cc at9Cactt tttcctacat tgccgaacaa caataaacta . tgtaa , ccfcc accgttaacc jii4o 

c 3 cc t cc g cc cttccatttt at tttataaa ccacccgatc caccttttca ^ 

att 3 cat 3t c a 3 ta 3333ca gtaaaacttt tgggagttaa aatccacaca ggttcttcac ^ 

aagctaagcg aaaatcagtt acacttataa aacc at c g ct aa Ca tc 3gac aaa 3 acaa 3 c 3132 0 

at g a gt cc a a a g c t tcc ggt tctggatcag atttttgttc attaacagcg ggagaaacag 31380 

cttc tggagg attttccatc tccatctcct tcatcagttc caccatgtcc accgtggtca 3i44o 

tc tgggacga gaacgacagt tgtcatacac ctcataagcc accggtcgat gacgaacgta 31500 

cagatctcga a 3aat3tC c t 3tcg cc g cc t ttcgg c agca c tgg3CC3aa 33C3aaa3 c 3 3XS60 

cccatgttta acaatggcca 3 c,cc 3 ccc 3 cttcatcagg cgcctagttc ttttagcgca 3X620 

acagcgcatg cgcagctcgc taa 3 act ggc g caa gaaaca cag caca gaa c cac c agatt 31680 

3tt ca t9 atc cca t aa gcgC gctgacacca 9 ccca t ac t a acaaa ttgtt tcactattct 3X740 

a 3 cat 9 aa tg tcatatctga Cgttcaagta ^ 

tcccacgtac aacacctcct ttgg catc tg a taatt aacc acctcccgat accaaataca 3X860 

tctctgatta ata g tc g ccc c gt acac t ac ccgat taaac c aagttgcca acataatccc 31920 
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ccctgccata 


caccgcaaag 


aacctggacg gctacaatga 


cagtgcaaag 


tccacacctp 


1 1 QRn 


gttgccatgg 


ataactgagg 


aacgecttaa gtcaatagtg 


gcacaactaa 


tacaaarstri 




caaatagtgt 


ttcaacaagt 


gccactcgta tgaggtgagt 


atcatgtccc 


aQQQaacaaa 


3 91 Of) 


ccactccata 


aacactgcaa 


aaccaacaca tcctaccatc 


ccccgcacgg 


cactcacatr 

w »-* w w wavQ V- Wj 


. J410U 


gtgcatggtg 


ttcatatcac 


agtccggaag ctgaggacaa 


QQaaaaatct 

ZJ Zj Zj W W) W 






ttcatagggc 


ggtagtgggt 


actccttgta ggggttcagt 


Cqqcaccaqt 


atTtrrhrao 
c*Uwwww-wl"»dw- 




cttccgggcc 


ataacacaca 


agttgagatc tgatttcaag 


qtactttcta 

1 W %A w WWW w W WJ 


ad Ly adddCC 




aagtgctttc 


ccaacaatgt 


atccgatgtc ttcggtcccc 


nrn t" CClftt' a n 

y*-y wwyy Ldy 


cgccccttgc 


32400 


agtacacacg 


gaacaaccac 


tcacgcaggc ccagaagaca 


Qttttccaca 


y »^yy tgaca 


3 24 60 


agttaatccc 


cctcagtctc 


agagecaata tagtttcttc 


Cacao t a err a 


udyyecaaac 


32520 


ccaaccagga 


aacacaagct 


ggcacgtccc gttcaacggg 


aqqacaaqaa 


a 3 ua y d yy c » 


J<£30U 


gaggcatagg 


caaagcaaca 


gaatttttat tccaactggt 


cacqtaqcac 




32o40 


aggtcacgta 


aatggcagcg 


atcttgggtt tcctgatgga 


acataacagc 


ddyd ccaaac 


32700 


atgagacgat 


tctcaaggtg 


attaaccaca gctggaatta 


aatcctccar 


yegcaca tec 


32760 


agaaacacca 


gcaatacaaa 


agcccggttt tetcegggat 


Ctatcataor 

w w w w u lmm ^ 


agcacag tea 


32620 


tcaattagtc 


ccaagtaatt 


ttcccgtttc caatctgtta 


taatttgcag 


a a haahnrr'p 
oaLaaLytvC 


329 00 


tgtaaatcca 


age egg c cat 


ggegaaaage tcagataatg 


cactttccac 


otacat t* <**at- 




aaacacaccc 


tcatcttgtc 


aatccaaaaa gtcttcttct 


tgagaaacct 


OfaohaaaHh 
y uoy laadit 


33000 


aagaatcgcc 


aggttaggct 


cgatgcctac atcccggagc 


t teat tctca 


y wd LytaCty 


3 305 0 


caaatgatcc 


agcagatcag 


aacagcaatt agcagccagc 


tcatccccoa 

W V» W> W> W»V|^f 


LLLL Cay LUC 


3 3120 


cggagttccc 


aeggcaatta 


tcactcgaaa cgtgggacaa 


atcgaaataa 


^exuy ayv, LL(». 


3 3 JLoO 


cacgtgagca 


aaagcegtag 


ggccagtgca ataatcacag 


aaccaocaaa 


actaaaydLUy 


33240 


cagctcatgt 


ttcaaaaagc 


tetgeagate aaaattcagc 


-wv^a wy w-ddd w 


aacacagtaa 


33300 


agtttgcggt 


atagtaaccg 


aaaaccacac oaqtcqacat 




cggcttacct 


33360 


aaaaaagaag 


cacattttta 


aaccacagtc gcttcctgaa 


caggaggaaa 


tatggtgcgg 


33420 


cgtaaaacca 


gacgcgccac 


cggatctccg gcagagccct 


gataatacag 


ccagctgtgg 


33480 


ttaaacagca 


aaacctttaa 


ttcggcaacg gttgaggtct 


ccacataatc 


agcgcccaca 


33540 


aaaatcccat 


ctcgaacttg 


etcgegtagg gagctaaaat 


ggecagtata 


gccccatggc 


33600 
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accc 3aaC9 c t aa tctg eaa 3tat a t3 a 3 a 3 ccacccca t tcggcgggat cacaaaatca ^ 

9tc 33 a 3 aaa acaacgtata cacccc 33 ac t 3 caaaa gct 3tt ca 39 caa. ac g cccc tgc 33720 

ggtccctctc 93t acacca g caaa g cc tCg 33t aaa 3 ca g ccatgccaag c 3 ct t acc gt 33780 

9ccaa gag c g actcagacga aaaa gtgta(c tgaggcgctc agagcagcgg ctatatactc 3384(} 

tacctgtgac gtcaagaacc gaaagtcaaa a gtt caccc 3 gcg c g ccc 3 a aaaaacccgc 33900 

gaaaatccac ccaaaaagcc cgcgaaaaac acttccgtat aaaattcccg ggtt acc ggc 33960 

gc 9 tcacc g c cgcgcgacac gcccgccccg ccccgcgctc ctccccgaaa cccgccgcgc 34020 

ccacttccgc gttcccaaga caaa 9g tc g c gt aactcc g c ccacctcatt t 9 cat gt taa 34080 
ctcggtcgcc atctt gcggt g ttatatt g a t 9 at g 



<210> 7 

<211> 44 

<212> DNA 

<213> Artificial 

<220> 
. <223> primer PSL 

<400> 7 

gcgcacgcgt ctctatcgat gaattccatt ggtgatggao atgc 

<2X0> 8 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> primer P5ITR 
<400> 8 

gcgcatttaa atcatcatca ataatatacc tcaaac 

<210> 9 

<211> 31 

<212> DNA * 

<213> Artificial \ 

<220> 

<223> primer P5XTOP 
<400> 9 

gatacctagg aacgaggagg atttgatatt g 

<210> 10 
<211> 20 



44 



36 



31 
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<212> DNA 

<213> Artificial 

<220> 

<223> primer P5XBOT 

<400> 10 

atgtacgcct ccgcgctcac 

<210> 11 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> primer P5E4 

<400> 11 

gatcgaattc ccactctgta ccccatctct g 

<210> 12 

<211> 31967 

<212> DNA 

<213> Simian adenovirus 



<220> 

<221> CDS 

<222> (13796) . . (15322) 
<220> 

<221> CDS 

<222> (18257) . . (21010) 
<220> 

<221> CDS 

<222> (27192) (29015) 
<400> 12 

catcatcata atatacctta tttgggaacg gtgccaatat gataatgagg aggcggggtt 
aggggtggag tgagggtggg gtgcggatga cgcgggcgcg gggcggggtg ggagtctgac 
gtggggcgcg gggtggagcg cgagggtgag ggcggggcga gggcggcggg cgcggcggaa 
ttgacgtaca cggtagtaag tttgagcgga aattaagtga attgggcgtg ttttttgtaa 
ctttttgacg tacacggtag taagtttgag cggaaattaa gtgaattggg cgtgtttttt 
gtaacttttt gacgtacacg gtagtaagtt tgagcggaaa ttaagtgaat tgggcgtgtt 
ttttgtaact ttttgacgta cacggtagta agtttgagcg gaaattaagt gaattgggcg 
tgttttttgt aactttttga cgtacacggt agtaagtttg agcggaaatt aagtgaattg 
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540 
600 
660 
720 
780 
840 
900 
960 
1020 



ggcgtgtttt ttgtaacttt ttgacgtaca cggtagtaag tttgagcgga aattaagtga 
attgggcgtg ttttttgt- ctttttgacg tacacggtag t aag tttgag c 9 g aaa ttaa 
g t g aa ttggg cgtgtttttt gtaacttttt ggtcattttg gcg Cgaaaac tgagtaatga 
ggaagcgaga cggactctgc ccttttttac gg tt ggg agg gaaaacCgct gatcagcgcc 
gaaCtttgg g c t ct ga c gc g gtggtttccc tacgtgg Cag tgccacgaga aggctcaaag 
tcctcgtttt attgtgtgct cagccttttt ga 3 gg tat t t aaacaccgtc agaccgtcaa 
gaggccactc tt g a gtgcga gcgagtagag ttttctcctc cgtcgctgcc gcggctgctc 
agtcttaccg ccaggatgcg aatgctgccg gagatcttca ccgggtcctg ggaagatgtt 
ttcc a9ggac ttttagaatc t gaa gacaac ttcccccaac ctcct gag cc ggaggagcta 
cct gaggttt cgcttcacga tct gtttgac g tggaggtgg agagccccga cggagatccg loao 
aac 9 a g9 aa g ct g tt 9 a tgg tatgttcccc gactggat9a tatctcagag cgagagtgct 
gaaggcagtg cggactcggg c gtttCtggg g ttggaaaC c tggtggaggt ggatctggac 1200 
ttgaagtgtt acgaggaagg ttttcctcct agcgactcag agactgatga agcctcag aa 1260 
gcggaaggtc aagaggagtc tgtgtgtggt tatgtgaaga ctaatgaggg ggagaaccfcg ^ 
ctggtgttgg actgtccgga ccaacctgga catggctgtc gagcctgtga c tttC acc 9 g 
999a cc agcg gaaac cc gga ag ct a tc tgt g c ttCgtg ct ac a t g c 9 tct g.acgagcac 
tgc at ataca gtgag C g tta ttcatgggtt atttatgggg aaag ttgggg gaaagtcctg 1500 
a 9 aa ggggaa aagtttaaca tgtcattttt gtacttgata ggtccagttt cggacgctga 156Q 
gggggattct gagtcccctg ctggtccttc ccagccctca ccctgctctt tgaccgccac 
gcccgcac Ct gaccta gtta gaccaacgcc c t gcc g a gtg tcctgtagac gacgtgcagc 
tgttaattgc atag aagatt tattggcccc tgatgacgag aacgcacctt tgaacctgfcg 
cctg aa acgc cc t a agacat cttgagtgtt ta t gc t g tta ataaaagtgt tgacccttag 
atcctgtgtt tattccttgg gcgtgtgcgc gggtatataa agcagctgcg ggctgg agtg 1860 
t ta g tttatt .c t g at ggag t actggagtga gctgcagaat taccagagcc tccggcgcct 1920 
gctggagttg gcctctgcca gaacatccac ctgc t gg agg ttctgttttg gctcgactct . 1980 
cagtaacgtg gtgtatcggg tgaagcaaga gtacagctcg cgcttttctg agctgttggc 2040 
ccgctacccg gctgtttttg tttctc tgga tctaggccat cacgtttatt tcca a g aagc 210 0 
tgtagtcaga tatttggatt tttctactcc cgggcgtgcg gtttctgcga ttgccttcat 2160 
ctgctttgtg ctagatcgat ggagcgccca aacccgcctg agcccggggt acaccctgga 2220 



1380 
1440 



1620 
1680 
1740 
1800 



126 



c t acc t9 acc a tgtccctgt 9gagggccat gctgcggaag ^ 
9cc 99Cg c 3g cc t cc gC ac 9 g ac tggat cc ggtgctggag gagtcggagc tggagga9ga 
gg a 9 aaccc 9 a 999 cc 99 cc tg3 accc C cc 99 c 9ga a t a g tg ac 9g aacc ggaggatccc 
caa gagggt a c t a gtC a ggg 9g9 a gggggg ccgaagagaa agcgggafcga agaggaggcg 

atggaccccg aca ggttCct aaaagaactg actttaagct t aat gtct aa gagaagaccc 
sa g ac 99 t 9t 99tggtctga tttggagaag gagttccacc gaatctgtfcg 

tacaa gtatg ggtttgagca ggtgaagact cactggctgg 

atggctttta acat gtttg c caa 9gt9gcg ct9 c g ccc 9g acactattta caccgtgact 
aa g ac 9gtgg aaatccgcaa gC c tgtgtat gtgattggca acggggccgt 
ca 9 accacc 9 acc 993tggc fctctaattgc tgtatgcaga 
aatcttaat, 9agtgacctt ttgcaatgfcc agattcg<;gg 

3tgttC9 cc 9 ccaccaccca 9 a t aaccc ta cac 9999tgt tcttccagca tgCaggcggg 

9Ct tgtgt a g atacc tgggc gaggg cc t c t gtgaggggct g caccttt gt gggctgttgg 
aaa 9 c 99tgg tgggtcgacc caagagtgtg 

t g tc t9 at 9g cca tggt9gt ggagggccag ggtaggatcc gccataacgc gggc . ccgag 
aatacc tgtt ttgccctgct gaagggtacg gcgaccgtga gatctgcggg 
9t 9ggt cac t c g ca gctgct g acc tg t gcg gatg9 caac t 9 cca 99 ccc t gcg cac ggtg 
ca Cgtggtgt cccacc 99 c g cc g cccct 99 cc 9gtgtttg aacataacat 9 c tg a tgcgc 
t 3t acca t9 c acc tgggct a cc g cc g c ggc gtgtttgtgc cccatcagtg taacctgacc 
cacaccaa gg tgttgcCgga gacggatgct ctttcgcgag tgaatctgaa 

.atctgacta tgg aga tgt a caa 3 ata gtg a g a tttg at g aatcaaagac cc 3 tt 3tcg c 
ccc tgcg a 9t gcg3t3 ccaa C cacc t3 a 33 a tgC a t ccc 3 t3 accc t gaa c gt gac 33 a g 
9 a 9 o t9 c 9 cc.c 99 accacca,a t3ctgtcc tgtctgcgca ccgatta<=ga aagcagcgafc 
Saggattaa. a ggtg a gggg cggggcctgc atggggtafca , aggtggggg aggaggtggg 
3 a 333g3 aaa acccaaaatg a gcgg atc g a t gga a 99g a g c g c tgtgagt tttg a gggcg 
9ggtgt tca g ccca t a tctg acaaccc gtc tccccgcctg ggcaggagtg cgtcagaatg 
t ggtggg ctc caac 3t99 ac 3g ac g tcc 9g t 99cccctgc caactccgcc aotctcaoct 
acgccaccgt c gg a t c gtcg ct gg acacc 3 ccgctgccgc c g cc 3 c tt ca g cc 3 cc 3 c tt 



2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 



127 



ctactgctcg cgg tatggca gctgattt gacCgtatca 

cc g tatca gcaaetcjgct g cgcctcgct 

— 1 * 9 ~ — - — tgga9 gagct9CCCC 

.^« 9t « ^ ^ cgtccccgct 

. 9 «aco 3 , 3 tgt9ttts9a aat „, aatg e3KttMit ^ 
— ». 9 ccc ttgaco . cct9tcgcg gtC9tt . a99 .„ ectgtg9i 
— «, aggtggg « t t ^ tugm gtacatgggc atgag _ 9t 

""""■'"^ 9999gCggtg tC9tagataa tccagtcgta 

ago99aagat gt „ etg . ga agca3gctga tg9c 

9t9ta99t9t tcacaaagcg gttgjgctg9 

~ g c aC c Ct99 .« gaatctegag gttggcgacg ttgco9ccc> 
gMg «c e9 « gtgca9g . ccaccagg>c ggtgtagccg gegMcttgg ^ 
«S».« t9 gaa9gsa , 99 cgtggaagaa cttggagMc cccKgtggc 
«cc, tgo „ « 9tccatga tgatggcgat ggg> _ tg ^ 

tg9g . ga c 9t cgtastcccg ctccagggtg agctcgtcgt ^ 
g9g > 9 c a99g t9c « 9actg geaccttcgg ^ 

3t . gtt9 cco t c gcagactt 9catct „„ ggcctcgatc ^ 
-c t9999c g o 9 . C9aaga a9acggt „ c as9ggcgggg ttgwgsgct . 
— «*» . g c ag c t9cg ,« t9cc9ca „ cggtgggc ccgtagatga cccc9> 

~ 9C t99ta9ttta aggagctgca gctgccgt . 0 tcgcgc ^ 

« 9 c tt « ga Wt9gtt tt „ ctgacg aggtcctgci 

™ 9g ga9a9aa9cc „ eccaggcc gcggaMtgc ttgaggggct 

atcttttcca gagcaggtac aggc ^ ^ 
~««~ . 099catotc gacccagcag acttcttggt tgcgggg9tt 

s*.^ gt9C9e9tcc a9g9aggC9a 999t9acgtc tteccagg 
c 9 ~ g c 9£te 9c9t9a99ge 99tctcggtg acggtgaagg gatgcgMM 

. tc «« t9ag „ tcatcctg ctggtgcgga <gcggg ^ c ^ 

9a9tC39 "* — — ^cc^c cgcgCg9ccc 

"^cca g « tgce « t 99a9ac9tgt ccgcag9cgg gaca9cgcag 

t999g9 co. 9 9 „ gaC99ac tcgggggagt aggcgtcggc gocgcactgs 
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gcgcacgtgg tctcgcactc gacgagccag gtg a g ctcc g ggtgttgggg atcaaaaacc 
agctggcccc cgtgtttttt gatgcgcttc t t acctcg gg tctccatgag gcggcgtccg 
gcttcggtga cg aaga gg C t gtcggtgtcg ccgtagacgg atttgagcgc gcgctgctcc 
aggggaatcc cgcgatcctc cgcgtgcagg aactcggacc actctgagac gaaggcccgg 
gtccacgcga ggacaaagga ggcgatctgg gacgg gtag c ggtcgttctc caccagggga 
tccaccttct cc agggtgtg caggcagagg tcgtcctcct ccgcgtccat gaaggtg att 
ggcttgtaag tgtatgtcac gtgaccgtcg gggtcgcgcg tgggcttata aaagggggcg 
tgcccggcct ccccgtcact ttcttccgca tcgctgtgga cgagatccag ct g ctc ggg t 61 20 
ga g ta ggcgc gctggaaggc gggcatgacc tc ggcgc t g a gggtgtC a g t ttccacgaac 
9a gg t gg a tt tgatattgac ctgtccggcg gc g a tg ct tt t g ac ggtgg c ggggtccatc 
tggtcagaaa agacgatctt ttt 9 tt gt cc a g ctt ggtgg cgaacgaccc gtagagggcg 
ttggagagca gcttggcgat ggagcgcagg gtctggttct tctcgcggte ggcgcgctcc 
ttggcggcga tgttgagctg gacgtactcg cgggccacgc agcgccattc ggggaagacg 
gtggcgcgct egtccggcag gagg c g c acg cgocagccgc ggttg t g c ag ggtgatgagg 
tccacgctgg tggccacctc gccgcgcagt ggctcgttgg tccagcagag gcgcccgccc 
ttgcgcgagc agaagggggg cagg ac g tc g agctggtcct ccgcgggggg gtcg9 cgtcg 
atggtgaag a tgcccggtag caggtggcgg tcgaagtagt cgatggcgac cgc ggggtcg 
gcgagggcgc gttcccagtc cctgaccgcc agggcgcgct cg t agggg tt gaggggcgcc 
ccccagggca tgggatgggt g a gggcc gag gcgta c at gc cgcagatgtc gtaga cg ta g 
aggggctcgc ggagcacgcc gaggtaggtg ggatagcagc gtccgccgcg gatgctggcg 
cgc acgta g t cg taC a t ctc gt gc gagggg g c ga gg aggc cgcctccgag gtcgccgcgc 
tgcggtctga cggcccggta ggtgacctgg cggaagatgg cgtgcgagtt gg a gg a g at g 
gtgggccgct ggaagatgtt gaagctggcc tcggggagtc cgacggcgtc gtggacgaac 
tgggcgtagg agtcgcgcag cttctgcacg agcgcggcgg tgacgagcac gtccagggcg 
cagtagtcga gggtctcgcg gacgaggtcg taacggggct cttgcttctt ttcccagagt 
tcgcggttga ggaggtactc ctcgcgatcc ttccagtact cttcggccgg aaagccgcgt 
tcgtccgcca ggtaagaacc cagcatgtag aagcggttga cggctcggta gggacagcag 7260 
cccttctcga cgggcaggga gtaggcctgc gcggccttcc tgagcgaggt gtgggtgagg 
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3 c 9 aa 9gtgt cgcgcaccat gaccfctgagg tgaagfccggt gtcgtcg ^ 

9 c 9 cccc gct ccca gagccc gtagtcggtg cgtttctggc tgcgggggtt gggcagggcg 
a aggt9 ac 3t cgttgaagag gatcttgccg gcgcgcggca ^^^^^ 
aa^cccc. gcacgtccga gcggttgfcta atgacctggg 
aagccgttsa tgttgtggcc gacgatgtag agcg<igggcg 
tt993g3 cc t tcttgagctc ctcgtaggtg aggcagtcgg gcgagfcagag 

t9tCggg ccc at t c ggC cac ctgggggttg gcttgcaaga agccccgcca 
9 c 9 a 9 c tggg tctggaggcg gtcgcggtag tcgcggaacfc ttttgcccac ^^^^^ 

tc 99gggtga ccac 9tagaa ggtgcggccg tcctggcccc aggcgtccca gtfcctgctcg 
c 939 c gagac ggcaggcctc ctcgacgagg gcctcctccc cacgact ^ 
a t9 aa gggga cgagttgctt gccgaaggca cccacccacg ^^^^^ tacgtcgt ^ 
9 t9 ac 9aaga gacgttcggt gcgaggatgc ^^^^ gaaagaagtt gatctcccgc 
c a =c agccgg aggagtgggc gttgacgtgg fcggaagtaga agtcacgccg gcgga ^ tg 
cac tcgtgct gatatttgta aaagcgggcg cagtactcgc agcgctgcac gctctgcact 
tcc tgaacga gatgcacccg gcgcccgcgc ^ 
g ct tC3g , g c gctgtccttc agcctcgcca . tgctcctctg . cacctgcacg ctcctgctgt 
99 3t99 a 9ga = ggagggagt gacgacgccg aggtccagat 
33 = 93 oc tga ggctcagcgc cagggtgcgg atctgagcgg ^^^^ 
3 cc t c gCtga ggtcgacggg cagcgtccgc 

33 c 9 cc aggc gctgatggta cttgatctcg agcggttcgt fcggcggaggt 
ta g a 3 c aggg cctgaccgcg ggcggcgacg atggtgccgc ggtgccggcg gcaggtggcg 
ta tt c 9gggg ggct c 9tta c a t caccc 3 cc t ggg cc tggc g cc ggg c ggc a 9 c gg3ggtt 
c t9gC ccc g c c ggcatgggc ggcag c ggca c gCcg9 c g c 3 gggct cc ggc a gcggctggt 

; 9 c t9ag c tcg cagctgactg gcgtgcgcga cgacgcggcg gttgaggtcc tggatgtgcc 

ccc gctgcgt gaagaccacc ggtccccgga ctcggaacct gaaagagagt tcgacagaat 

caa t ctc gg c atc 9ttgacg gccgcctgac gcaggatctc ct 9 cac gt c g ccc 9 a 9 t tgt 
cc t9gtaggc gatctcggac atgaactggfc cgatctcttc 

c 99 c gcgtt o gacggtggcc gcgaggtcgt fcggagatgcg ^^^^ tgggagaagg 
c 3t t gaggC c gttct c gCt c cacac gcg ac tgtagacgac gttgccgacg gcgtcccggg 
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c 3 c 9 cat g ac cacct gcg c g ac 9 tt 9 a gct ccac g t gtCg c g c 9 aa 9 ac 9 9 c 9t a 9ttg c 
gca ggC9 ct g g aa ga99t a 9 tt g a ggg t gg t g3 c gatg t 9 ctcgcagacg aagaagtaca 
tgacccagcg gcgcagcgtc atctcgttga t gt ctcc g a g ggcttccaag cgctccatgg 
cctcgtagaa gtcgacggcg aagttgaaga ac tgggag tt gcgcgccgcg accgtcagct 
cgtcttgcaa gagccggatc agctgggcca cggtctcccg cacctcgcgt tcgaaggccc 
ccggcgcttc ttcctcctct ggttcctcgg cggcctcttc ttccatgacg gcttcctctt 
cctcc ggttc ctcgggcacg gg cc t cc ggc 99 c 9 ac 3g c g cctgatgggc a ggcggt cca 
cgaagcgttc gatgatctct cc gcggcgg c ggcgcatggt ttcggtgacg gcgcggccgt 
tctctcgggg ccgcagttcg aagacgcccc cgcgcaggcc gccggcgccg cc gagagggg 
9ca 9gaggtg gggg ccttc g ggcagcgaga gggcgctgac gatgcaccgt atcatctgtt 
gcgtaggtac agctctccag gagtcgttga gcgagtccag ttggacggga tccgagaact 
tt t c 9 a gg aa a g c tt c gatc caatc g ca gt c g caa gg taa g ct gagg ac g g t gggatgag 
9 3 gctt gg c g ggaggcggag gcggcagaag agga 9gagg a gggcaggctg g a ggtgatg c 
tgctgatgat 9t aa tt9 aa g t a ggcggttt tcaaacggcg g at ggCggcg a ggagg ac g a 
cgtctttggg cccggcctgc tggatgcgca ggcggtcggc catgccccag gcgtggctct 
99 catc gg c g caggtccttg tagtagtctt 9 ca tg a gtct ctcgacgggg acgtcgtctt 10020 
cgtcggcccg gtcggccatg c gggtggagc cgaacccgcg caggggctgc a g ca gg gcca 10 080 
9 g t= gg cgac cac g c g ttc g g cca 9 cac gg cct 9 ct gg a t ctgggtgagc gtggtctgga 10140 
agtcgtccag gtccacgaag c ggtggtagg agcccgtgtt gatggtgtag gtgcagttgg 10200 
cca tg ac 3ga ccagttgacg actt g ca tgc cgggc tgggt gat c tcgg t g t a g c g9 a g9 c 10260 
g c 9agt a g9 c ccgcgactcg aagacgtagt cgttgcaggt gcgcacgagg t act gg ta 9 c 10320 
cgacgaggaa gtgcgg c gg c ggctcgcggt a g a ggggcC a gcgcacggtg gc gggggcg c 10380 
cg 3ggg cca g gtcctccagc at g a ggcggt ggtagtggta gacgtagcgc gagagccagg 10440 
t9at9CC " C ^cgga 9gtg gcggc 9 c ggg c gaa g t c gcg gac g c ggttc ca g at gttgc 10500 
gcaagggggo gaagcgctcc at ggtgggca cg ctc Cgg cc ggt gagg Cgg gcg ca gt cct 10560 
g cac 9 ctc t a gacgggacag agagcgggag gttagcggct ccgctccgtg 9 cct ggggg a 10620 
cagaccgcca gggtgcgacg gcg gggaac c cc 9gt tc gag accggctgga tccgtccgtc 10680 
cccgacgcgc cggccccgcg tccacgaccc caccagaggc cga g accca g ccgcggtgcc 10740 
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cggaccccag atacggaggg gagccttttt gtggtttttt cccgtagatg catccggtgt 10800 

tgcgacagat gcgtccgtcg ccagcgccgc cgacgcagcc gccgctcccg ccccccacca 10860 

gcgccgcgga ggctctgtcc ggcggccgcg gcgacccgga ggaggaggcc atcctcgact 10920 

tggaagaagg cgagggcctg gcccggctgg gagcgccctc ccccgagcgc catccccgcg 10980 

tgcagctggc gagagactcg cgccaggcct acgtgccgcc gcagaatctg ttcagggacc 11040 

gcagcggcca ggagcccgag gagatgaggg accgcaggtt tcacgcgggg cgggagctgc moo 

gcgcgggctt cgaccgtcgg cgggtgttgc gcgccgaaga cttcgagccc gacgagcgca 11160 

9cggag t a a3 tccggcacgg gcgcacgtgt cggcggccaa cccggtgacc gcgtacgagc 11220 

agacggtgaa cgaggagcgg agctttcaga aaagcttcaa caaccacgtg cgcaccctga 11280 

tcgcgcgcga ggaggtggcc atcggcctga tgcatctgtg ggactttgtg gaggcgtacg 11340 

tgcagaaccc gtcgagcaag ccgctgacgg cgcagttgtt cctgatcgtg cagcacagtc 11400 

gggacaacga gacgttccgc gaggcgatgc tgaacatcgc ggagcccgag ggccgctggc 11460 

tcttggaccc gattaacatc ctgcagagca tcgtggtgca ggagcgcagc ctgagcctgg n 5 20 

ccgacaaggt ggcggccatc aactacagca cgtcgagcct gggcaagttt tacgcccgca 11580 

agatccacaa gagcccctac gtgcccatag acaaggaggt gaagatcgac agcttttaca 11640 

tgcggatggc gctgaaagtg ctgacgctga gcgacgatct gggggtgtac cgcaacgacc . 11700 

gcatccacaa ggccgtgagc gccagccgcc ggcgcgagct gagcgaccgc gagctgatgc 11760 

acagcctgcg gaggg Cgctg gcgggcgccg gcggcggcga ggaggccgag tcctacttcg 11820 

acatgggggc ggacttgcag tggcagccca gcgcgcgggc cctggaggcg gcgggctacc 11880 

gcggcggcgg cggcgtggtc gaggcggagg acgaggacga ggtggagtac gaggaggagg 11940 

actgatcggc gaggtgttct cgtagatgca gcgcgcgacg gcggcggcga gcgggccgca 12000 

gggggacccc gccgtgctgg cggccctgca gagccaacct tcgggcgtga acgcctccga 12060 

tgactgggcg gcggccatgg accgcatttt ggccttgacc acccgcaacc ccgaggcctt 12120 

tagacagcag ccgcaggcca accgcttttc ggccatcttg gaagccgtgg tgccctcgcg 12180 

caccaacccc acgcacgaga aggtcctggc ggtggtgaac gcgctgctgg agagcaaggc 12240 

gatccgcaag gacgaggcgg ggctgattta caacgccctg ctggagcggg tggcgcgcta 12300 

caacagcacc aacgtgcagg ccaacctgga ccgtctgacg acggacgtgc gggaggcggt 12360 

ggcgcagcgg gagcgcttca tgcgcgacac gaacctgggc tcgcaggtgg ccctgaacgc 12420 

cttcctgagc acgcagccgg ccaacgtgcc gcgcgggcag gaggactacg tcagtttcat 12480 
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ca 9 c 3cgct9 c 9 cc tcCtg9 t99 cc 9a99t . 9ccgcagagc ga9gtgtacc agtcgggtcc 12540 

.gactacttc ttccagacct c 9 c g9 ca 993 cctgcagacg gtgaacctga cgcaggcctt 1260Q 

caagaacctg gaag g ca tg t ggggegtgcg ggceccegtg ggcgaccggg cgacgaCctc 12660 

cagcttgctg acgccgaaca c 9 c gg c tgct gctgctgctg atcgcgccct tcaccaatag 12720 

cagtaccatc agccgcgact cgtacctggg ccacctgatc acgctgtacc gcgaggccat a2780 

cgggca ggcg ca 9gtggacg agcagacctt ccaggagatt acgagcgtga gccgggccct 12g40 

ggggcagcag gacacg g9ta g cc t g gaggc gacgctgaat tttctgctga ccaaccggcg „,„ 

gcagaagatc ccctcccagt acacgctgag cac ggaggag gag c g catc t t g c g ctacgt 12,60 
gcagcagtcc gtgagcctgt atct gatgcg c g a 333ggcg agcccctcgt cggcgctgga 

catgacggcc cgtaaca tgg agccgtcgcc gtacgcggcc cacc gg cc gt tcgtgaaccg 13080 

cctgatggac tacctgcacc gcgccgccgc ca tg aac 3g c g a g tacttta cgaacgccat 13140 

cctgaacccg cactggatgc cgccgtccgg tctctacacg gggg ac Ct t 3 acat 9 ccc 9 a 13200 

gggcgacgac gggttcctgt gggacgacgt gt c 33 aca 3 c gtg ttc g cgc cgg t gcg tC c 13260 

gggcaagaag gagggcggcg acgagctgcc gctg.ccgtg g t gg aggcgg cgtcgcgcgg 13320 

ccagagcccg ttccccagcc tcccgtcgtt. gt c 3gcg a g c agcagcagcg g ccggg tct c 13380 

gcgcccgcgg c tgg a gggcg accacccgaa c g accc gctg ctgcgccccg cccggcccaa 13440 

gaactttccc aacaac 9ggg tggagagcct a gtggataag atgaatcgct ggaagaccta 13S00 

cgcccaggag ca g c 99gagt gggaggagag tcagccccgc cccctgcctc cgccgcgctc 13560 

cagg tggC gc cg gcggg aa g aa g accc gg a a g actc gg c g g ac g ata 9 ca 9 c g t gttg9 a 13620 

ctt 9gggggg acc ggtg cc g cctcgacaaa ccc 9 ttc g cc cacctgcgcc cgca ggg ccg 13680 

gct 9ggtcgg ctgtattgag gaaagaaact aataaaagaa aaaagagctt gcttaccaga 13740 

g ccat ggtcg ca g c gt c 99 t ccc tttgtgt gtgttttctc ctccccggta . gcgaa afcg ^ 
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agg aac age ate cgt tac tea gag ctg gcg ccg ctg tac gac acc ace 13990 
Arg Asn Ser He Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr 

55 60. " 65 

Ard Si Hi rtf, T C tCg 9C9 9aC atc 9C9 tC9 ct 9 aac "038 

Arg Val Tyr Leu Val Asp Asn Lys Ser Ala Asp He Ala Ser Leu Asn 

™ 75 80 

tac cag aac gac cat age aac ttt ctg acc acg gtg gtg cag aac aat 
Tyr Gin Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn 
85 go 95 

gac ttt acc ccg gtg gag gcg ggc acg cag acc ata aat ttc gac gag 
Asp Phe Thr Pro Val Glu Ala Gly Thr Gin Thr lie Asn Phe Asp Glu 
100 10S 110 

cgc teg egg tgg ggc ggc gac ctg aaa acc atc ctg cgc acc aac atg 14182 
Arg Ser Arg Trp Gly Gly Asp Leu Lys Thr He Leu Arg Thr Asn Met 
115 120 125 

ccc aac atc aac gag ttc atg tec acc aac aag ttc agg gec egg ttg 
Pro Asn lie Asn Glu Phe Met Ser Thr Asn Lys Phe Arg Ala Arg Leu 
130 135 140 



145 



14086 



14134 



14230 



t J S? n? 9 f ?, ? 339 933 aCC aat 9CC CCt c 9 a tac 9*3 tgg 14278 

Met Val Glu Lys Val Asn Lys Glu Thr Asn Ala Pro Arg Tyr Glu Trp 

150 155 160 

It* Hf 9 " C a ^ C Ct9 ccc 939 " c aac tac tc S 9ag acc atg acc ata 14326 
Phe Glu Phe Thr Leu Pro Glu Gly Asn Tyr Ser Glu Thr Met Thr He 
1«5 170 175 

gac ctg atg aat aac gcg atc gtg gac aac tac ttg gaa gtg ggg egg 14374 
Asp Leu Met Asn Asn Ala He Val Asp Asn Tyr Leu Glu Val Gly Arg 
180 185 190 

cag aac ggg gtg ctg gag age gac atc ggg gtg aag ttt gac acg cgc 14422 
Gin Asn Gly Val Leu Glu Ser Asp He Gly Val Lys Phe Asp Thr Arg 
195 200 205 

til dk C a" f t9 ?.? C t99 9aC CC9 9t<= aCC aa9 Ct9 9tc at 9 ccc 99C 14470 
Asn Phe Arg Leu Gly Trp Asp Pro Val Thr Lys Leu Val Met Pro Glv 

210 220 22S 

?/? I* 6 aaC 939 9CC ttc cac ccc 9ac afcc 9 tc cfc g ctg ccc ggc 14S18 
Val Tyr Thr Asn Glu Ala Phe His Pro Asp He Val Leu Leu Pro Gly 
230 235 240 

lit r?! S t9 ?*° fc w° a ° 9 CaS 39C C " Ct9 a9C aac ct9 cfc g g99 atc 14566 
Cys Gly Val Asp Phe Thr Gin Ser Arg Leu Ser Asn Leu Leu Gly He 

245 250 255 

cgc aag egg atg ccc ttc cag gcg ggt ttt cag atc atg tac gag gac 14614 
Arg Lys Arg Met Pro Phe Gin Ala Gly Phe Gin He Met Tyr Glu Asp 
260 265 270 
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ctg gag ggc ggc aac ate ccc gec ttg eta aac afn «^r. * 

Leu Glu aiy Gly Asn He Pro Si. Leu Su Sp S? KS Lys S Glu 

Ila sir no Gin tt 9 f? 9 t" 9 f 9 039 " C Ca9 atc c 9«= 99C 3 ac 
Ala Ser lie Gin Lys Ala Ar 9 Glu Gin Gly Gin Glu lie Arg ofy lap 

295 300 305 

= S 5 S £ £ S 5 51! S! £ S S S! H £ 



315 320 



gat age aag gac cgc agt tac aac eta etc ccc ggc gac cag acc aac 
Asp Ser Lys Asp Arg Ser Tyr Asn Leu Leu Pro G?y Asp Gin Thr Asn 
■" 5 33 ° 335 

Th? !?! ^ C * 9 ° a9 ° t99 ttC Ct9 9 CC tac aac tac gg C gac ccc gaq 
Thr Ala Tyr Arg Ser Trp Phe Leu Ala Tyr Asn Tyr G?y Asp Pro Glu 



345 3 50 



ft 9 n 9C f,^ 399 tCC t99 ac9 ctg ct 9 acc a cc acg gac gtc acc tac 
Lys Gly Val Arg Ser Trp Thr Leu Leu Thr Thr Th? Asp III Thr Cys 

360 3 65 

Gly III Gin Gin V=? tC9 " C CC9 93C at9 at 9 caa 9 a <= ccc 

Gly Ser Gin Gin Val Tyr Trp Ser Leu Pro Asp Met Met Gin Asp Pro 



380 



385 



a i fc IZ C t? C 9tc agt 9 aa aa c gtt ccc gec etc aca gat cacqqa acc 
.11. Thr Thr Val Ser Glu Asn, Val Pro Ala Leu Thr Asp £is G?y Thr 



455 460 



465 



14662 



14710 



14758 



14806 



14854 



14902 



14950 



14998 



Val Thr 111 1" t CC tCC a9C Caa 9CC a9C aac tac ccc gtg gtg gga 
Val Thr Phe Arg Pro Ser Ser Gin Val Ser Asn Tyr Pro Val Val IS 

390 395 4 oo 

e s £ s s s s s 5 = ~ «- « H a s 

40S 410 415 

Va? Tyr Ser Sn Leu *^ ?" ^ 9 3CC 9C9 CtC * C * cac 9 tc "c 15094 
vai Tyr Ser Gin Leu He Arg Gin Ser Thr Ala Leu Thr His Val Phe 

420 425 430 

= 5 s s is s a s-2 a s s s s s 



15190 



15238 



ctg ccg ctg cgc age agt ate agt gga gtc cag cgc gtg acc ate act 
Leu Pro Leu Arg Ser Ser He Ser Gly Val Gin Arg Va! Thr lie ?hr 
470 475 460 

gac gec egg cga agg acc tgc ccc tac gtg cac aag gec cto aac ata 
Asp Ala Arg Arg Arg Thr Cys Pro Tyr Va? His Lys Ala Leu lly lit 
485 490 J 



495 
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« s s e s £ ss * * - - ». 

500 505 



cattctcatc tcgccc 9 aca acaaoa=c 93 c tg33gc ct g c g ctc gg cc 9 g cat g tac 9g 15392 

C " C9CCaa9 C99C9CtCCa c 9 c 93 ccac t acc 9g9C ccc 15452 

c tggggcg cc cacaa 9C9Cg gcgtctccac gcgcaccacc 9 tc 9 ac 9 ac g cca t c 3 ac 9 c X55 l2 

c gt c 9t99C c ca 99 cca g ac 9 c t acc 9 cc 9 9CC caa 9t c 3 ac 9gtg9 a Cg cc gt ca t c 9 a 15 S7 2 

ca 9 c 3tggtg 9ccgacgcgc ggcgatacgc tcgacgcaag ^^^^^ accgccgtcg i5g32 

c Cg tcccacc gcc9C ca t9 c t ggcCg cca g a gcggtC ct 3 a 3 ac 9 c gcg e 9 cc 9 c 9tggg 15692 
cc 9 cc g a g cc a tgcg cc g a g cc gcgg ccaa c g cca gcg c g ggtC9 c 9 ccc gtCgt ca ggc 15752 

C9CCC93Ca9 ******* ~«<*ccaa cc tggc ccaa cccc 9 cc 9gg g aaac 9 t gt a „.„ 
c tgggtg c 9 a g ac gc9tcgg gcgtgcgcgt gccggtgcgc acccgccccc ctcggagtta i5872 

9 aa 9 acaaaa a 9 ac 99 ac 9 a a 9 ac tg a 3tt tccctgtcgt tgccagcatg agcaagcgca ^ 

a 3 ttcaaa 9 a a 9 a g ct gctg gaggc cc t c g tg ccc gag at c t ac 99 ccc 9 9 cc 9 cc 9 ct 9 15992 

cc 9 cc 9 c g9 t ggcgg ac gt c aa g ccc 3 aa 3 tt aa 3 cccc 3 c 3 c 3 ct 3 aa 3 c 99gtt aaaa 16052 

a 9 c ggg aaaa g aaa 9 a 9g a 9 aa 9 ca 9g aa g ca g39ttgct agacgtcgac gacggcgtgg ^ 

a 3tt c gt9 c 9 9t cc ttcgC9 cccc 9tC9 cc 93gtg ca gtg 9 c gg9gtcgc cgcgtcaagc 16l72 

t c 9t ccc gcg gcC999C acc 9tggtgtctt tcacccccgg cctgcgttcg gccacgcgcg i6232 

9cc Cg aa gcg c g a gtacg ac g a ggtctaCg gcg ac g aa 9 a catcct 99 a 3 ca 99 cc 9 ccc 16292 

a 9 ca 9 ctc gg 33 a 3ttt9ct tac 9gC aa 3 c gcgg cc gct a c gggg a 9gtg 9 c 9 c tgg c 9 c 16352 

t 99acca 9gg caatcccacg cccagcctca a g ccc gt cac gCtg ca 9 ca 9 gCg c tg ccc g 1MU 

t9 a 9 c 9 c 9tc gaccgagagc aa g c g339 ca tcaa 3 a 333 a 3 at 9ggcg ac c tg ca 9 ccca l 6472 

ccatgcaact cat ggCgC cc aaac 93 ca 3 a a 9 ct g9 a gg a c 9 t 9 c tgg a g aacat g aaa g 16532 

t 39 atccca 9 catc 9 a 9 ccc 9 aa 9 t g aaa 3 t3 c 3 acccat caa 99 aa 9tg 9g ccc 99g cc 16592 

ta ggcgtgc a 3 ac ggtgg ac at t ca g atcc cc gtg c g c g c ctcccccgtt tc tg ccacca . 16652 

c t ac 9 ac ggc c 9tggagg cc at g3 aaac 3 c a 3 ac 33 a 3 c t 3 ccc gcgg cc tt 99 c 99 ca g 16712 

cc 9 ccacc 9 c c 9 cc gcgg ct accc gag a 3 a t 933 cat 3 ca gaccgacccc t ggt ac 3 a 3 t 1677 2 

tc 9 cc 9g ccc c 3 ccc 3 tc gt ccacgagccc g tc 33t ac gc 99 c 9 accacc t ccc ggc tcc i 6832 

ctgactacgt cttgcatcct tccatcacgc c 9 ac 9 ccc g3 ctacc 3 c 99 a acgaccttcc 16892 

gcccc ggt c 3 c g c gcg cacc accacccgcc gtcgt c 3 cac caccc 3 cc 3 c c g tc gC a g cc 16952 
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gtcgcgcact ggct ccca t c gcgg ttcgcc gcgtcgtccg ccggggfccgc ^ 

tscccaccsc gcgttaccac cccagcatcg tMtttaacc tgcgctgccg ^^^^ ^ o72 

9gctc t gacg tgccgctttc 9cttccccgt tcggcactac ^^^^^ ^^^^^ i7 ^ 2 

gactggtcta gcgggcagcg gtctccgacg cc 9 cc g cc 3 c. geggtgc.cc ggcgcatgaa 17192 

gggcggcatt ctgccc9cgc tgatccccat tatcgccgcc gccatcgggg ^^^^ i7252 

cgtggcctcg gtggccttgc aagcagctcg caaaaatfcaa ataaagaagg ^ 

actgcctggt cctgactg tt tcatgcagac aagacatgga agacatcaat tttgcgtcgt 17372 

tggccccgcg gcacggctcg cggccgttca tgggcacctg gaacgagatc gg cacca g cc !743 2 

agctcaacgg gggcgctttc ag ttggag ca gcct gtg g ag cg9cattaaa aactttgggfc ^ 

ccacgattaa g acc t a tggc aaC aa ggcgt ggaacagtag cactggtcag atgctccgcg 17552 

a t aa gctgaa ggaccagaac ttccagcaga aagtggtaga cggtctggcc • Ccgggcatca ^ 

ac g ggg tggt ggacctgg cc aacca ggcgg tgcagaacca gatcaaccag c g tc tggag a 17672 

acagccgcca gccgcccgcg gccctgcagc agcgtccgca ggCggaggag gtggaagtgg 17v32 

aggagaagct gccgcccc tg gagacggtgt C9Ccg9tggg cgtgcctagc aaggggggga ^ 

agcggccgcg gcccgagctc gaggagaccc tagtgaccga gaccctggag ccgccctcgt 17852 

acgagcaggc Cttgaaagag ggggccacgc ccctgcc<=aC gacccggccc atcggaccca ^ 

t g9 cccg a cc ggtCCacgg c aa ggaaca ca aagccgtgac gctagagctg cctccgccgg 17972 

cgcccacc gt a CCC cc g a t g cccg gtccca ccctgggcac cgccgtgcct cgtcccgccg 18032 

cccc 3 cc ggt c gccg tgg C c acgcccgcgc g ccc g ag tC g cggagccaac t gg cagagca 18092 

ctctgaacag catc gtgggc ct gggagtga aaagcctgaa acgccgccgg Cgttactatt ^ 
aaagccagct aaatacccat gtgttgCatg cgcctcctgt gtcacgccag aaaaagccag 

ccgagtgacg ggtcaccgcc gccgccaaga gcgccgcttt caag atg gcc acc ccc 

Met Ala Thr Pro 



18268 



510 



s ~ '£ » = - £ S a s tit ui s; a z; s "™ 

520 525 

2 a 2 2 s si 2 a a 2 K si s = 5 — 

535 540 

3 2 2 IS 2 5 £ 2 S 3 S £ s 51? Ill S 
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545 550 555 



560 



acc cac gac gtg acg acg gac egg tec cag egg ctg acg ctg egg ttc 
Thr Hi. Asp Val Thr Thr Asp Arg Ser Gin Ar| Leu Thf Leu Arg Pne 
565 S70 57 | 

gtg ccc gte gac cgc gag gac acc gcg tac teg tac aaa gtg cgc ttc 
Val, Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser Tyr Lys Va! Arg lit 
580 585 590 

2 111 E? !!- ?.*! ? t9 ^c atg gec age acg tac 

Ala 
605 



;C 7 T 3 3 ^ yc "- aac C 9 C 9 C 9 ct 9 gac atg gec aqc acq tac 
Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met Ila sir Thr Tyl 
595 600 



ttt gac ate cgc ggc gtg ttg gac cgc ggt ccc age ttc aaa ccc tac 
Phe Asp He Arg Gly Val Leu Asp Arg Gly Pro sir Phe lys Pro Tyr 
bi0 615 620 

Ser Glv ?hr 2? C l*° t a ° tCC Ct9 9CC CCC aa 9 99 c 9 CC cc <= •« ccg 
Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Pro 

625 630 635 640 

tea gaa tgg aag ggc tea gac aac aaa att agt gta aga qqt cag act 
Ser Glu Trp Lys Gly Ser Asp Asn Lys lie Ser Val Arg Ify Gin Ala 



«45 6S0 



ccg ttt ttt agt aca tec att aca aag gat gqt att caa at- a „rr. 
Pro Phe Phe Ser Thr Ser He Thr Lyf V P Sy ill SS III lit Thr 
660 «5 670 • 

gat act tct age gga get gtg tat get aaa aag gaa tat cag cct qaa 
Asp Thr ser Ser Gly Ala Val Tyr Ala Lys Lys Glu Tyr Gin Pro llu 



680 68 5 



cca caa gta ggg caa gaa caa tgg aac age gaa gec agt gat aqt aat 
Pro Gin Val Gly Gin Glu Gin Trp Asn Ser llu Ala Ser 2p Ser Sp 
0 695 700 

aaa gta get ggt agg att eta aaa gac aca aca ccc atg ttc cct tat 
Lys Val Ala Gly Arg He Leu Lys Asp Thr Thr Pro Met 111 Pro Cys 
05 710 715 729 

tac ggt tec tac gee . aag ccc aca aat gaa cag ggg ggg C aa ggc act 
Tyr Gly Ser Tyr Ala Lys Pro Thr Asn Glu Gin Gly Gly tin III Tnr 
72S 730 735 

aat act gta gat ctg cag ttc ttt gec tct tea teg get acc tct aca 
Asn Thr val Asp Leu Gin Phe Phe Ala Ser Ser Ser III Thr Ser tS 
740 745 750 

Pro III 111 , CtC l 3t 9 f ° 939 93C 9Cg gca ata aaa 9= a cca gac 
Pro Lys Ala Val Leu Tyr Ala Glu Asp Val Ala lie Glu Ala Pro Asp 
755 7 6 o , 65 

acc cat ttg gtg tac aaa ccg gca gtt aca acc acg acc act agt tec 
Thr His Leu Val Tyr Lys Pro Ala Val Thr Thr Thr Thr Thr Ser Ser 
770 775 780 
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caa gac ctg eta act cag cag get get ccc aac cga ccc aac tac att 19132 
Gin Asp Leu Leu Thr Gin Gin Ala Ala Pro Asn Arg Pro Asn Tyr lit 

795 800 
ggc ttc agg gat aat ttt ate aat etc at-a t-^ - 

Ply Phe. Arg Asp Asn Phe He c? y Leu S £ £ Asn Ser Thr G ?y 1918 ° 
805 • BIO . 815 

aat atg ggt gtt ttg gca ggg caa get tct cag eta aac acq ate att 
Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn IS Val ?S 

®20 one 

tf^s 830 

2 2 = 2 5 2 2 H 2 2 2 2 2 2 2 1S2 ' S 

835 840 845 

get ttg ggc gac cgc agt cgt tac ttc tec atg tgg aac caa acc ata m , a 
Ala Leu Gly Asp Arg Ser Arg Tyr Phe Ser Mel Tr? Asn Gin Ma Si 

855 8 60 

2 S 2 2 2 2 S 2 S S E 2 S 2 S S! 



2 B 2 2 £ 2 2 S 2 £ 2 2 2 2 2 2 

885 



895 



19468 



n,* ff fc I at aCa " C cta tca ccc caa aa c gga agt aac acq taa aca 
Glu Thr Tyr Thr Gly Leu Ser Pro Gln Asn Gly sir tin Thr Trp Thr 

900 9°5 910 

S 2 2 5 2 2 E 2 2 2 » K 2 2 2 2 19516 

920 925 

2 2 2 2 2 IS 2 2 2 5 E 2 2 22 2 19M < 

935 940 

Pne Leu Tyr Ser Asn Val E r t9 " 9 CCC 9aC 939 taC aa * ctc "612 
gVI Tyr Ser Asn Va * Ala Leu Leu Pro Asp Glu Tyr Lys Leu 

950 95S 960 

ita Pro lit Air, t? C ^ f tC CC ° 9aC 3aC aaa aac act "c 9«c tac 19660 
Thr Pro Asp Asn lie- Thr Leu Pro Asp Asn Lys Asn Thr Tyr Asp Tyr 

965 970 975 

m C ? » aC 2? C C9C 9t9 9CC acc ccc age tec etc gac acc tac qtc aac 1970S 
Met Asn Gly Arg Val Ala Ala Pro Ser Ser Leu Asp Thr T^r Va? Asn 
9B0 98S 990 

lie cfy H A« iS Ser Pr« P° S* ,t9 «" «" 916 aac CCC ttc 19756 
y o„ Arg Trp Ser Pro As P Pr ° Met Asp Asn Val Asn Pro Phe 

1000 1005 



139 



= ==5 = B£255:S£ ssa 

1015 1020 
ggc aac ggc cgc tac gta ccc ttc car am ~* 

oi, s «, „ 9 nr ? al « s s s a s a 5: 

1030 1035 

-£22222 222211? 222 

1045 10S0 
tac gag tgg aac ttc cgc aag gac qtc aac at-a * + ~ 

1*. oiu s t„ Pte A ? g £ 11 g « a s « s = 

1050 1065 

1075 10B0 

1090 1095 
1,° e C ° CtC 9ag ^cc atg ctg cgc aac gac 

1105 1110 



acc aac gac 
Thr Asn Asp 



a 2 2 =5.S 2 SK ffi £ fi ~ - - 



1135 



1140 



1150 1155 

5 SS 2 2 £ 2 5 2 S5 2 S 2 2 2 2 

1165 1170 

22^22222211^22 222 

1180 1185 

1190 f?L et Phe Asp Ser Ser Val Ser 

1195 1200 

SSffiSKSa 25ESSB SK5!£ 

1210 1215 

aag cgc acc gtg gac ggg gaa ggg tac aac gtg gec cag tgc aac 



19801 



19846 



19891 



19936 



19981 



20026 



20071 



20116 



20161 



20206 



20251 



20296 



20341 



20386 



20431 



140 



^ So A8P . Gly gj s ^ T ^ A - VU Ala Gln Cy8 Asn 

* 225 1230 

1240 1245 

ss'sasssssHsjs: ass 

1255 1260 

s s = s s * = s s s a a s a a 

1270 1275 
128S 1290 

Phe G?n » 3C ! aC tC9 999 ttc 99a tac atg ggc ccc acc 

Sis " ?' y PhS Val Gly ^ "yS Thr 

1300 1305 

M*? ? 39 5? 9 Ca9 9CC tac ccc acc aa ^ tac ccc tac ccc eta 

Met Arg Glu Gly Gin Ala Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu 

1315 1320 

1330 1335 

1345 1350 

Ser Me? III !? 9 f tC 9a ° Ct9 999 Ca * aac at 9 etg tac gec 

Ser Met Gly Ala Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala 

1360 1365 

ser E H C SjES tC ! IC at g acc ttc gag gtg gaccccatg 
f!!L Ala HXS Ala Leu As P Thr Phe Glu Val Asp Pro Met 

1380 



1370 



1375 



1405 1410 

ft* t SC *u 9 CC9 ttC tC9 * cc 99t aac gec acc acc 
Leu Arg Thr Pro Phe Ser Ala Gly Asn La Thr Thr 

1415 1420 1425 

gecgaeggat gggctccagc gagceggage tggtcgecat cgcgcgcgac ctgggctgcg 



20476 



20521 



20566 



20611 



20656 



20701 



20746 



20791 



20836 



20881 



20926 



20971 



taa ggagggggee 21020 



21080 



14] 



ggccctactt cctgggcacc tttgacaaac gcttcccggg cttcgtggcg ccgcacaagc 21X40 

tggcctgcgc catcgtcaac accgccggac gcgagaccgg cggcgtccac tggctggccc 21200 

tggcctggaa cccccgcagc cgaacctgct acctcttcga ccccttcggc ttctcggacg 21260 

acaggctcag gcagatctac cagttcgagt acgaaggccc gctccggcgc agcgccctcg 21320 

cctccacccc cgaccactgc gtcaccctcg tcaagtccac ccagaccgtc caggggcccc 21300 

gctcggccgc ctgcggcctc ttctgctgca tgttcctgca cgccttcgtg cgctggcccg 21,40 

cctcccccat ggacggcaac cccaccatgg acctccttac gggcgttccc aacagcatgc 21500 

ttcagagtcc ccaggtcgag cccaccctcc accgcaacca ggaggaactc tacgccttcc 21560 

tggctcggca ctccccctac tttcgccgcc accgcgagcg catagaaaag gccaccgcgt 2162 0 

ttgacaaaat gaacgactag attttctgtg aaaaacactc aataaagcct ttattggttc 216B0 

accacacgtg cacgcatgca gactttttat ttaaaagggc tccgcctcct cgtcgccgtg 21740 

gctggtgggg agggagacgt tgcgatactg caggcgggag ctccatctga actcgggaat 21800 

cagcagcttg ggcagggggc cctcgacgtt ctcgctccac agcttgcgca ccagctgcag 2X860 

ggcgcccagc aggtcgggcg.cggagatctt gaagtcgcag ttggggccct ggttgccgcg 21920 

ggagttgcgg tacaccgggt t gg C gcac t g gaacaccagc acgc t gggg t . gctcgatgct 21980 

ggccagcgcc gtcttgtcgg tcacctcgtc gccgcgcagg gactccgcgt tgctcagcgc 22040 

gaaggcggtc agcttgcaca gctgccgacc cagcacgggc accccgctcg gctggttcag 22100 

9cag tC gcag cgcatagcca tcagcagccg cttctgcccg tgctgcatct tcggatagtc 22160 

93Ctcgcatg aaggcctcca tctgccggaa ggccgtctgc gccttgctgc cctccgagaa 22220 

gaacagcccg caggacttgc cggagaacac gttgttgccg cagctcacgt cttccacgca 22280 

gcagcgcgcg tcgtcgttct tcagctgcac cacgctgcgg ccccagcggt tCt gcaccac 22340 

cttggtcttg ccgggatgtt ccttcagggc ccgctggccg ttctcgctgg tcacgtccat 22400 

ctccaccacc tgctccttct ggatcatctc cagcccgtgg tagcagcgca gcacgccctc 22460 

ctgctcggtg cacccgtgca gccagacggc gcagccggtc ggctccagot gttgaggttt 22520 

caccccggcg taggtctcca cgtacgcccg caggaagcgg cccatcatct ccacaaaggt 22580 

cttctgaccg gtgaaggtca gctgcagccc gcgatgctcc tcgttgagcc acgtctgaca 22640 

gatcttgcgg tacaccttgc cctgctcggg cagaaacttg aaagcggcct tctcctcggg 22700 

ctccacgtgg tacttctcca tcagcgccga catcagctcc atgcccttct cccaggccga 22760 

caccagcggc tccgcgcggg ggttcaccac cgccatgcct cgggaagtgc cggggcgctc 22820 
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tectc.tc* « t « toetg aggcggcggc s3c9gca9tt g ^ 2!<m 

— 9 c= 9 « 99cctCcc ca=9ccgggg tg9gt9Mcc cgtgggccac 2j8io 

— «*, t c«« tccc cttcgctgtc ggg „ egaet tcgggag>99 !jom 
a^acc^ 9c99ccactg cggcca(cgc afwectt> ^ 

a*™ ttgcaggtag t9tgg3agst 

* 999 c 9 ctc t ^co.^ toec , ggcg , _ gieggag 2jiso 

9a93i£ "" 9 ' C * 9 ^* ™- cc,ccccc 9 c c t « 9a99CC 23240 

. 9 c ta9acat 9caaC99gag g . gg , gg . gg acgtgcgagt ^ 

cc 999 «« 9 t9aC9cc9cc gaggagccgc ^ 

" S ""'^ S """^ -~~ t9aa9 „„ t 99cgcggc . g „„, 

agcaccatcg tgcaggaggc cctcaaggag cgcgaqqaca tC( <™^ 

ay a cgcgaggagg tcccgctgac ggtggaggag 23480 

ctcagccggg cctacgaagc caacctctte t-™. 

caacctcttc tcgccgcggg tgccccccaa gaagcaggcc 23540 

aacggcacct gcgagcccaa cccccgcctc aacttctacc ccgtctttac o Q ^ 

c cc gcctttgc ggtgcccgag 23600 

scs^cc. « eatcac , t cttcttcaag aaccagcgca tccratetc ^ 

. ..cc.c.ccc ^i.^^ gcgccgccat aeatgagatc jj72o 
199 a99a . 9tccc c .. g . ccttc gaaggtctcg gca>ggacga 9aagc9c9eg 2j78o 
9 c aaac9Ctc t99a „, 9 „ cg , gagcgag ggccagaacg t3ctgg ^ ^ 

tt ccc«o. a ^c.^ tcggtcacgg at=agctgct catcaa9cgc ^ 
Scc^ccc tC9a9aac9a ctccgaggtg gatcccgagg acggaaaacc cgtOTtetcs ^ 
9 ac 9a99a9c tcgcgcg « g gct99gcacg ccga3t ^ ^ 

aa 9 « 9atga 9ctggtcilcc gccgagctcg agtgcctgca ^ 

9 oc 9 .ccccc > 9a c« t9 c 9 caa99tcgag gagccc „ gc actacgccct ccgccacggc ^ 
»c 9t9 c 9 oc a99 c« 9caa gtggag(;tC(i gcaaoctggt eccct>wg , ^ 

S3~ tc « 9 c « gagaaccg cctcggg „ g aacgtccra actgcaccct gaccggggag ^ 

act.c.tcc, Wllc t „ otctete tcacccteac ccggcagacc jj3so 
9cc at99999 tctg9cagoa gtgtctggag gaBeswc<! cccgcg>gcc 24440 
« 9 a 9 cc 9 c 9 . 9c9cc9C9a gctctggacg agcgcawgc ^ 
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™ t c atctteac ag „ t „ a3aacM cctgocegac j456o 

9t " 9 " ' 9 " 9C " 9 " ™ ~~«- t c« 9gagcg . eecggcatc „,„ 
«™ tg , g « 9=9c ccegccctcc s „ ttc9tco ^^^^ ^ 

CCCCCC^ t « 99a9CO , ctgct>cct9 K9cgtct9g cca>ctacct c9cccacMc 24740 

,CCS1M " e "" 9 "" *— -« i— ~- ocgctgdaac 24800 

ctct gcara cccc^ctc ^ctsc ,.e.« 9agc t9ctca9c9a gac9caagtg 24B6o 
«c 99t .c« « gagatcca gtgoteccM 

t ""~ e " ~- — "~c 9 , gga ctatcacgcc 2 „ 80 
„ ttctacga agac „ atc9 cgacc _ 2so<o 

. 9 4 9 C„ 9 « tctggc _ etgcaageca te=agciggc ccgccaag ^ 

«cc t c« ga eggggt « at ccggaccccc a9>ccggcga gg>acccaac 25im 

~™ „ tcc9ccgc ogctccgtgc aaoatgacge MM ^ 

atC9 cc, agc ggcgggccaa gaagcagcaa sagctccagg 2s2so 

c 9W c t999 ,c. 9= o agg=9gagga agt «„ gac gaggaggagg aca£ggagag 253<o 
« 999 .c, 9 c etagacgag9 aggaggaggc ^ 

99 .. 9 « g ece a g c ageg cc g c 9g cac=atc settee... „„„ . 

39 gaagcggctc ggagc<:ggc<: 25M0 

^ C ' 30a9 " 9 '= •—««. CCCCC^O C.CC.CC.,, ,SS 2 0 

""""" 90C " 9 ^" «*— ~ twtc*: « aaagcccc 255S0 

•»«U, ccccc^c ^ g999tactgt tcct9gcgcc cccacaag>g 256io 

C4. 9 «C 9 « g c« g « t « agC .« 9Ccg ggg „ acacc tccttc9cgc ggcgctactc 2s7oo 

9 «c CC cc.e p^, cggt9CC9C9 eaacgtccto taceattacc gtcaccteta 2576o 
=a 9 ccc«ac 9 . 9acagaag 9cccggc „ c cgcgcaa9ic ^ 
cc.cc.^c gacccgecag gactcggcc9 ^ ^ 

" acc " 9 " t9 ""<"* — — 9 « ggaaatg a „ gtaoaaa 25M0 

* OC9 "^ [ 9=9t " 9CtC « g . ccaacego 26000 

W,C9C[ C9a "* C9 ~ "™ «9«c g9tg t ctc ttaagg 26060 

•« 9 .aa 9 « agcgctttte cagagg „ ca ttacgtcata Mcatcacga ^ ^ 

tCCC " C9CCt t "" 9t "' « c^ccc, gcgcccccca 26l80 

""""" " 3tC9C " 9a <*~»~" —»t, »=»,«., 262>0 
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tg 9 9attc gt gccacccgca atcagatact gctggaacag gccgccctca cctccacccc 26300 

gcgacgtcag ctgaacccgc ccgcttggcc cgccgcccag gtgtaccagg aaaaccccgc 26360 

cccgaccaca gtcctcctgc cacgcgacgc ggaggccgaa gtccagatga ctaactccgg 26420 

ggcgcaatta gcgggcggcg cccgccacgt cgtcgctccc gggtacagag gtcggcccgc .26480 

accctacccc tccggcccta taaagaggct gatcattcga ggccgaggta tccagctcaa 26540 

cgacgaggtg gtgagctcct cgaccggtct tcggcccgac ggagtcttcc agcttggagg 26600 

cgccggccgc tcttccttca ccactcgcca ggcctacctg acgctccaga gctcttcctc 26660 

ccagcctcgc tccggcggca tcggcaccct ccagttcgtg gaggagttcg tgccctcggt 26720 

ccacttcaac ccgttctccg gctctcccgg ccgctacccg gacagcttca tccccaacta 26780 

cgacgcggtg agcgaatccg tggacggeta cgattgatga ccgatggtgc ggccgtaact 26840 

gcgcggcggc aacatctgca tcactgccat cgtcctcggt gcttcgcccg ggaggcctgt 26900 

gagttcatct acttccagct cgccccggac cagcttcagg gcccttcgca cggcgttaag 26960 

ctcgtgatag aggaagagct cgagagtagc tgcctgcgct gttttacctc gcgccccatc 27020 

ctagtcgaga gggaacgcgg taggaccacc ctcaccctct actgcatctg tgactccccg 27080 

gaattacacg aagacctgtg ttgccttcta tgtgccgaac aataacccct cttgtaacta 27140 

cctacatcca caataaacca gaatttggaa actcctttcg tttgtttgca g atg aaa 27197 



Met Lys 



til I?° t 9 ° f tC 9aC 9aC 9ac ttc aac ccc 9 tc fc ac <=cc tat gac 27242 
Arg Ala Arg Leu Asp Asp Asp Phe Asn Pro Val Tyr Pro Tyr Asp 

1430 1435 



1440 



act ccc aac get ccc tct gtt ccc ttc ate act cct ccc ttc gtc 
Thr Pro Asn Ala Pro Ser Val Pro Phe lie Thr Pro Pro Phe Val 
1445 1450 i45 5 



27287 



27332 



tec teg gac ggc ttg caa gaa aaa cca ccc gga atg etc aqt etc 
Ser Ser Asp Gly Leu Gin Glu Lys Pro Pro l!y Me! Leu Ser Leu 
1460 1465 1470 

Asn £t Gin o Ct *Z C aCC « a ^ 3g9gcatta act eta 

Asn Tyr Gin Asp Pro He Thr Thr Gin Asn Gly Ala Leu Thr Leu 
1475 1480 1485 

aag ctt ggc age gga ctg aac ata aac caa gat ggg gaa ctt acc 
Lys Leu Gly Ser Gly Leu Asn lie Asn Gin Asp G?y llu Leu Thr 
1490 1495 isoo 

III !? C a3 ° 9 ^ CtC 9tC aCt Ccc ccc att aca *aa gee aac 27467 
Ser Asp Ala Ser Val Leu Val Thr Pro Pro He Thr Lys Ala Asn 



27377 



27422 
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1505 



1510 



1515 



aac aca ata ggc eta gec ttc aat qca cat ntt «... 

a™ Thr n. Gly ^ M . Phe -J £ « «. c„ g 

1525 1530 

1540 1545 

aat agg ttg gga ata aca tac aac tct ccc ctr ^ 

j« my Ile Tht ^ £ - JJJ « «. ca. aac 

1555 1560 

sir E 2 E Leu S £ 52 ?* f" *<=<= 

1565 Pr ° Pr ° Leu As P Thr Ala Asn 

1570 1S75 

1585 159Q 

£ = IS Si S £ 2 s 5 S £ £ £ X = 

1600 1605 

S £ £ £ 5 £ s= k k E E £ 3J £ £ 

1615 1620 

1630 1635 

S pJe S S Leu E J? f 90 f * Ct "« «* 

1640 G1U ArS Pro Leu Ala L ^u Thr 

1645 1650 

1655 ?«, 116 Ser Gln Pro Leu Lys Val 

1660 1665 

E5 a !SS: = 2=SSS22 EE 

1675 1680 
aaa aac gga aat ctt act tta gqa acc caa - 

Ly. «, «» » mr _ !5 s s £ ;g .£ £ £ 

1690 1695 
1705 171Q 

3 E E IS £ K £ £ 31! £ £ - £ - £ 

1720 1725 



27512 



27557 



27602 



27647 



27692 



27737 



27782 



2.7827 



27872 



27917 



27962 



28007 



28052 



28097 
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ttt aat aac acc tea ctg gga gtg 
Phe Asn Asn Thr Ser Leu Gly Val 
1730 1735 



gca t'tc aac cca ccc att tct 
Ala Phe Asn Pro Pro He Ser 
1740 



tea gca aac 
Ser Ala Asn 
. 1745 



aac ggg ctg tct ctt gac att gga aat ggc ctt aea 
Asn Gly Leu Ser Leu Asp lie Gly Asn Gly Leu Thr 
1750 1755 



ctg caa tac aac agg etc gta gtg 
Leu Gin Tyr Asn Arg Leu Val Val 
1760 i7 65 



aac att ggc ggc ggg eta cag 
Asn He Gly Gly Gly Leu Gin 
1770 



ttt aac aac ggt get att acc get tec ata aat gca get eta cca 
Phe Asn Asn Gly Ala He Thr Ala Ser He Asn La Ma Leu Pro 
1775 17 80 1785 

r tt9 ^ 9 l at tC ° aat aaC Ca9 ctt tct ctt aat att gga ggc ggg 
Leu Gin Tyr Ser Asn Asn Gin Leu Ser Leu Asn He Gly Gly of? 
1790 1795 1800 

ctg cga tac aac ggc act tac aaa aat tta gec gtc aaa acc gac 
Leu Arg Tyr Asn Gly Thr Tyr Lys Asn Leu Ala Val Lys Thr Isp 
1805 1810 ibis 



tct ttt agg ggt ctt gaa att gac 
Ser Phe Arg Gly Leu Glu He Asp 
1820 1Q25 



agt aat cag ttc ctg gtg cca 
Ser Asn Gin Phe Leu Val Pro 
1830 



aga ctg ggt tct ggt eta aag ttt gat caa tat ggg tac att age 
Arg Leu Gly Ser Gly Leu Lys Phe Asp Gin Tyr Ily Tyr He Ser 
1835 1840 184 5 

n Ct CCa aCt 9tt aCg CCa aca aca ctt t99 act aea gca 
Val lie Pro Pro Thr Val Thr Pro Thr Thr Leu Trp Thr Thr Ala 

1850 1855 1860 



gac cct tct 
Asp Pro Ser 
1865 



ccc aac get act ttt tac gac age tta gat get aag 
Pro Asn Ala Thr Phe Tyr Asp Ser Leu Asp Ala Lys 
1870 1875 



gta tgg ctg gec tta gta aaa tgc aac ggc atg gtt aat gga acc 
val Trp Leu Ala Leu Val Lys Cys Asn Gly Met Val Asn l?y Thr 
1880 1885 1890 

Tit "!?? f a9 ?f tta aaa 99t act ct 9 ctc caa cct aeg get 

lie Ala lie Lys Ala Leu Lys Gly Thr Leu Leu Gin Pro Thr Ala 
1895 i 900 19Q5 

^ 9t tCt tW 9tt at9 tat tfct tac a 9 c aa t ggc acc aga 

Ser Phe He Ser Phe Val Met Tyr Phe Tyr Ser Asn Ily Thr Arg 
1910 i 9 i5 192Q 

tit J? l aC CCC a ° 9 ttL 9aa aat 9 aa SSC ata eta get agt 

Arg Thr Asn Tyr Pro Thr Phe Glu Asn Glu Gly He Leu Ala Ser 
I 925 1930 1935 . 



28142 



28187 



28232 



28277 



28322 



28367 



28412 



28457 



28502 



28547 



28592 



28637 



28682 



28727 
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sir S ?hr g" SJ £ 2 S K ™ 

1940 V ^ Ar9 ?i" Gly Asn Ser Asn Thr Asn 

1945 1950 

S 2 £ S E S 32 S E E £ E E E ~ 

1955 , r Thr ^9 T >r Pro 

1960 1965 

ss-«£eeeeeeeeee 

1975 1980 
ttc ttg cag g ga gac ccc act atq qcc ata tn* 

- - «j oly « P Pro th, £ E E £ £ E E E 

1990 1995 

E E E E 2 E E E E E E E T -* 

2000 7 Ser Leu Phe T hr Trp Arg 

2005 2010 

^EEEEEEEEEEEEEE 

2020 2025 

ryl Ue S K 51 tM aCaCtgtttt tctttte** gttttfttc 
2030 

tgctttttta caca gttC3a accgtcagac tccctccccc cetccacttc acccggtaca 
cctcccgctc cccctggatc gctgcgtaca ac t gcag ttt ggtgttcaga cacgggttct 
taggtgacag tatccacacg gcctctttgc cggccaggcg ctggtccgta atgctcacaa 
atccctccga cacgtcctcc agacacacgg tggaatccaa ggcgcccgtc tacaaaacaa 
acacagtcat gctccccacg ggttctctcc tcggtcgtac tgcgccagcg tgaacgggcg 
atggtgctcc atcagggctc gcagcaaccg ctgtcggcgc ggctcaccca ggctccggcg 
aaaagcgccc cgtctgggag tgctattcaa aaaacgcacc gcctttatca acagtctcct 
c 9 tgcggcgg gcgcagcagc gcacctggat ctctgtcagg tctttacaat aggtacagcc 
catcaccacc atgttgttta aaatcccaaa gctaaacacg ctccacccaa atgacatgaa 
ttccagcacc gccgcggcgt ggccatcata caatatgcgg aggtaaatca ggtgccgccc 
cctaatacaa acgctcccca tatacatcac ctccttaggc agttgataat taaccacctc 
ccggtaccag ggaaacctca cgtttactaa agccccaaac accaacattt taaaccagtc 
agccagcacc acccctcccg ccttacactg cagcgacccc ggctgtttac aatgacagtg 
aatcacccac ctctcatacc ccctaatgac ctggcgtggc tccacatcta tagtagcaca 
gcacacgcac accctcatgt aatgct.cat cacaaatctt tcccaagggg ttagtatcat 



28772 

28817 

28862 

28907 

28952 

28997 

29045 

29105 

29165 

29225 

29285 

29345 

29405 

29465 

29525 

29585 

29645 

29705 

29765 

29825 

29885 

29945 
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gt ccca gggt ac ggg ccac t cctgcagcac 3gtg aaa ggt acgcaggcgg gaacagtcct 30005 
cacc t c 3gac acataatgca tattcagatg ttcacactct aaaacccc gg ggct tccctc 30065 
caacgcagcc actggcaagt tctcagaggg tggtgtaagg c 9 g t gg tgct gatagggact 30125 
caatctgtgt cgacaccgtc tgtcgcgttg catcgtagac caacgcttgg cgcaccgcct 30185 
cgtacttcgc ccaaagaaaa c gg g t gcgac gccaacacac ttcc 3 cgtac c gtgggttcc 3 0245 
.cactcgagc tc g ctca gtt cCcaacgcat aa tgc a g cca ttcctgtaat ccacacaaca 30305 
g tc g ctc ggc ttccaaagag at gtg cacct cgtatcttat aacgtcccga tatatatcca 30365 
agca 33 ca 3t ca 3gg ccac t t 3 caacca 3 t g cac g ca ggc ggac tg atc g cgacacactg 30425 

gaggtggagg gagagacgga a 9 a gg cat gt tactccagac 99 t cg aaaa g c 3g atcaaa g 30485 

t 3 ca 3 atc 3 c gaagatggca 3 c 3 atcccc 3 cc 3 ctac 3 ct ggtgatagat caca 3 cca 3g 30545 

teaaacataa tgcgg ttttc caaa tg acct attaccgcct ccaccagagc c 9 ccac 3 c 9 c 30605 

acttccagaa acaccagcac ggctacggca ttctcctcaa aatcttcaaa cattaagctg 30665 

catgattgaa tcacccccaa ataattctcc t cct t cca tC ctcgcaaaat ttgagt aaaa 30725 

acctctcgca gattagctcc gtggcgt tca aaaa ggtC ac ttagagcgcc ctccaccgcc 30785 

atgcgcaagc acaccctcat gattgaaaaa tgccagtctc ctgaaccacc tgcagttgat 30845 

ttaaaagacc tatatta 33 a tcaattccac tctcccgcag ctccacgcgt agcattagct 30905 

gcaaaaagtc atttaaatct tcgcaaacta 3 c 3 c 33t aa 3 ctcgccgccg g gaa ttaggt 30965 

ctgaagcagt caccacacac ataa tttC ca 3tg aa ggagt cagtctaagc agcaaaaagc 3X025 

c 3 catga 3 ca gtgttgaaaa 9gaggggtca cgcaatgtaa catatgcagc caaaaatctc 31 085 

caaggtgtct gtgcataaac tccaccac tg aaaagtccaa atcatgtaaa tatgccatca 31145 

cc g cc t ca 9g aaccaccacg gacacaaaaa c 39g cc g ta g caaatacatg 3 t g tcct g ca 3X205 

aagcaaaaac acatttatac catagaggcg cgaattactt 333 gaaaaat cactcgctcc 3X265 

aaaactaaac aggccaccgt ctgaccgcgc cagccataaa aaaagcggtt cgaatgatta 31325 

aaaagaataa tagacacctc ccaccaggta ctcggctgca actc g t gcg o ccctatcaaa 31385 

acccc 9 c g9 a cgttcatgtc ggccatagaa aaaatgcggc ccaaatatcc caccggaatc 31445 

tceacggcca gctgcagtga tagoaaaaga acgccatgag gagcaatcac aaaattttca 3X505 

ggcgataaaa gcacataaag gttagaatag ccctgctgca caggtaataa agcccgcgag 31565 

ctcagcaaat gcacataaac cgcttcagcc atcccgtctt accgcgaaca aaaggctcac 31625 
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agtacacagt tactcaaccc acacgccaca cagtatttat acactcctca atcgccacgt 31685 

cacccgcccc gaacaaactc caaaagtcca aaaagtccaa aacgcccgcg taaaagcccg 3X745 

ccaaaacagc acttcctcat ttactctccc acagtacgtc acttccgccg cgcccgccgc 31 805 

cctcgccccg ccctcaccct cgcgctccac cccgcgcccc acgtcagact cccaccccgc 31865 

cccgcgcccg cgtcatccgc accccaccct cactccaccc ctaaccccgc ctcctcatta 31925 

tcatattggc accgttccca aataaggtat attatgatga tg 31967 

<210> 13 
<211> sos 
<212> PRT 

<213> Simian adenovirus 
<400> 13 

Met Arg Arg Ala Val Gly Val Pro Pro Val Met Ala Tyr Ala Glu Gly 
5 10 15 

Pro Pro Pro Ser Tyr Glu Thr Val Met Gly Ala Ala Asp Ser Pro Ala 
20 25 30 

Thr Leu Glu Ala Leu Tyr Val Pro Pro Arg Tyr- Leu Gly Pro Thr Glu 
5 40. 45 

Gly Arg Asn Ser He Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr 
0 55 60 

Thr Arg Val Tyr Leu Val Asp Asn Lys Ser Ala Asp lie Ala Ser Leu 
" 70 75 80 

Asn Tyr Gin Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn 
65 90 95 

Asn Asp Phe Thr Pro Val Glu Ala Gly Thr Gin Thr lie Asn Phe Asp 
100 10S 110 

Glu Arg Ser Arg Trp Gly Gly Asp Leu Lys Thr lie Leu Arg Thr Asn 
115 120 125 

Met Pro Asn lie Asn Glu Phe Met Ser Thr Asn Lys Phe Arg Ala Arg 
±JV 135 140 

Leu Met Val Glu Lys Val Asn Lys Glu Thr Asn Ala Pro Arg Tyr Glu 



150 



145 



150 



Trp Phe Glu Phe Thr Leu Pro Glu Gly Asn Tyr Ser Glu Thr 



165 



170 



He Asp Leu Met Asn Asn Ala lie Val Asp Asn Tyr Leu 
180 ■ 



185 



Met Thr 
175 



Glu Val Gly 



190 



Arg Gin Asn Gly Val Leu Glu Ser Asp lie Gly Val Lys Phe Asp 



200 



Thr 



205 



Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr Lys Leu Val 



215 



220 



Met Pro 



Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp He Val Leu Leu Pro 



230 



235 



240 



Gly Cys Gly Val Asp Phe Thr Gin Ser Arg Leu Ser Asn Leu Leu Gly 



250 255 
lie Arg Lys Arg Met Pro Phe Gin Ala Gly Phe Gin II 



260 



265 



e Met Tyr Glu 
270 



Asp Leu Glu Gly Gly Asn He Pro Ala Leu Leu Asp Val Ala Lys Tyr 
^ 280 285 

Glu Ala Ser He Gin Lys Ala Arg Glu Gin Gly Gin Glu lie Arg Gly 



300 



Asp Asn Phe Thr Val lie Pro Arg Asp Val Glu lie val Pro val Glu 

310 315 

Lys Asp Ser Lys Asp Arg Ser Tyr Asn Leu Leu Pro Gly Asp Gin Thr 
325 "0 335 

Asn Thr Ala Tyr Arg Ser Trp Phe Leu Ala Tyr Asn Tyr Gly Asp Pro 

345 350 

Glu Lys Gly Val Arg Ser Trp Thr Leu Leu Thr Thr Thr Asp Val Thr 



360 



365 



Cys Gly Ser Gin Gin Vai Tyr Trp Ser Leu Pro Asp Met Met Gin Asp 

375 380 
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Pro Val Thr Phe Arg Pro Ser q P r n« v^i o 

385 9 „° Sr Ser Gln Val Se * Asn Tyr Pro Val Val 



395 



400 



Gly Val Glu Leu Leu Pro Val Hie Ala Lye Ser Phe Tyr Asn oiu Gin 

410 415 
Ala Val Tyr Ser Gin Leu He Arg Gin Ser Thr Ala Leu Thr Hie Val 



425 



430 

Phe Asn Arg Phe Pro Glu Asn Gin lie Leu Val Arg 



435 * xe uexi val Ar 9 Pro Pro Ala Pro 

440 445 

Thr lie Thr Thr Val ser Glu Asn Val Pro Ala Leu Thr Asp His Gly 

455 460 



Tel L6U Ar9 l? Q Ser Ile -* <*Y val Gin Arg Val Thr lie 



475 



480 



Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val His Lys Ala Leu Gly 



490 



He Val Ala Pro Lys Val Leu Ser Ser Arg Thr Phe 



495 



<210> 14 
<211> 917 
<212> PRT 

<213> Simian adenovirus 
<400> 14 

Met Ala Thr Pro Ser Met Met Pro Gin Trp Ser Tyr Met His lie Ala 

10 IS 

Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 

25 in ' . 



30 

Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 

40 45 

Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 

55 60 
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Thr Leu Ar g Phe val Pro V al Asp Arg Glu Asp Thr ^ Tyr ^ ^ 

75 80 

I-ys Val Arg Phe Thr Leu A la Val Gly Asp A3n Arg Val Leu Agp ^ 

90 95 

Ala Ser Thr Tyr Phe Asp He Arg Gly Val Le « Asp Arg Giy Pro Ser 

105 110 

Phe Lys Pro Tyr Ser Gly Thr Ma Ty r A sn Ser Leu A la Pro Lys Gly 

120 125 

Ala Pro A3 n Pro Ser Glu Trp Lys Gly Ser Asp Asn Lys He Ser Val 

135 140 

Jg Oly Gin Ala Pro Phe Phe Ser Thr Ser xle Thr Lys Asp Gly Ile 

i50 155 

Gin Val A la Thr Asp Thr Ser Ser Gly Ala Val Tyr A la Lys Lys 

170 175 

•Tyr Cln Pro Pro oln val Gly Giu oin ^ ^ 



160 



Glu 



185 



190 



Ser Asp Ser Asp Lys Val Ala Gly Arg He L 



200 



Leu Lys Asp Thr Thr Pro 



205 



Met Phe Pro Cys Tyr Gly Ser Tyr Ala Lys Pro 



215 



Thr Asn Glu Gin Gly 



220 



Gly Gin Gly Thr ^n Thr Val Asp Leu Gin Phe Phe Ala Ser Ser Ser 



235 



240 



Ala Thr Ser Thr Pro Lys Ala Val Leu Tyr Ala Glu Asp Val Ala He 



250 



Glu Ala Pro Asp Thr His Leu Val Tyr Lys Pro Ala 



265 



255 



Val Thr Thr Thr 
270 



Thr Thr Ser Ser Gin Asp Leu Leu Thr Gin Gin Ala Ala Pro Asn Ar g 

280 285 

Pro Asn Tyr lie Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr 



153 



290 



295 



300 



Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu 
310 315 

Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin 



330 



335 



Leu Met Leu Asp Ala Leu Gly Asp Arg Ser Arg Tyr Phe Ser Met Trp 



345 



350 



Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg lie He Glu 



360 



Asn 



365 



His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly 



380 



Ser Leu Val Thr Glu Thr Tyr Thr Gly Leu Ser Pro Gin Asn Gly Ser 



395 



400 



Asn Thr Trp Thr Thr Asp Ser Thr Thr Tyr Ala Thr Arg Gly Val Glu 

410 415 



He Gly ser Gly Asn Met Phe Ala Met Glu lie Asn Leu Ala Ala Asn 
420 425 430 

Leu Trp Arg Ser Phe Leu Tyr Ser Asn Val Ala Leu Tyr Leu Pro Asp 

440 445 

Glu Tyr Lys Leu Thr Pro Asp Asn He Thr Leu Pro Asp Asn Lys Asn 
3U 455 460 

Thr Tyr Asp Tyr Met Asn Gly Arg Val Ala Ala Pro Ser Ser Leu Asp 
470 475 480 

Thr Tyr Val Asn lie Gly Ala Arg Trp Ser Pro Asp Pro Met Asp Asn 
485 490 495 

Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser 



SOS 



510 



Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His lie Gin Val 

I> JL 3 



520 



Pro 



525 
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Gin Lys Phe Phe Ala lie Lys Asn Leu Leu Leu Leu Pro Gly Ser Tyr 



540 



Thr Tyr Glu Trp Agn Phe Arg Lys Asp Val Asn Met lie 



550 



555 



Leu Gin Ser 
560 



Ser Leu Gly Asn Asp Leu Arg Val Asp Gly Ala Ser Val Arg Phe Asp 



570 



575 



Ser He Asn Leu Tyr Ala Asn Phe 
580 



Phe Pro Met Ala His Asn Thr Ala 
565 



Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser 



600 



605 



Phe 



Asn Asp Tyr Leu Cys Ala Ala Asn Met Leu Tyr Pro lie Pro Ala 



615 



620 



Asn 



Ala Thr Ser Val Pro lie Ser lie Pro Ser Arg Asn Trp Ala Ala Phe 

630 "5 640 

Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Lys Glu Thr PrO Ser Leu 
645 "0 655 

Gly Ser Gly Phe Asp Pro Tyr Phe Thr Tyr Ser Gly Ser He Pro Tyr 

665 670 



Leu Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser lie 



680 



685 



Met Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr 



695 



700 



Pro Asn Glu Phe Glu lie Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn 



7 " 720 
Val Ala Gin Cys Asn Met Thr Lys Asp Trp Phe Leu He Gin 



725 



730 



Met Leu 
735 



Ser His Tyr Asn He Gly Tyr Gin Gly Phe Tyr Val Pro Glu Gly Tyr 



745 



750 



155 



Lys Asp Arg Met Tyr Ser Phe Phe Arg A 3 n Phe Gin Pro Met Ser Arg 

760 765 

Gin Val Val Asp Thr Thr Thr Tyr Thr Asp Tyr Lys Asn Val Thr Leu 

775 780 

Pro Phe Gin His Asn Asn Ser Gly Phe Val Gly Tyr Met Gly Pro Thr 
790 7 95 800 



lie 
815 



Met Arg Glu Gly Gin Ala Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu 
805 810 815 

Gly Lys Thr Ala Val Pro Ser Leu Thr Gin Lys Lys Phe Leu Cys Asp 

Arg Thr Met Trp Arg lie Pro Phe Ser Ser Asn Phe Met Ser Met Gly 
UJb 840 845 

Ala Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala Asn Ser Ala His 

855 860 

Ala Leu Asp Met Thr Phe Glu Val Asp Pro Met Asp Glu Pro Thr Leu 



875 



880 



Leu Tyr Val Leu Phe Glu Val Phe Asp Val Val Arg lie His Gin 



885 



890 895 



His Arg Gly Val lie Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser 



900 



Gly Asn Ala Thr Thr 
915 



<210> 15 

<211> 607 

<212> PRT 

<213> Simian adenovirus 

<400> 15 



905 910 



Pro 



Ala 



Met Lys Arg Ala Arg Leu Asp Asp Asp Phe Asn Pro Val Tyr Pro Tyr 
Asp Thr Pro Asn Ala Pro Ser Val Pro Phe lie Thr Pro Pro Phe Val 



156 



20 



25 



30 



Ser Ser Asp Gly Leu Gin Glu Lys Pro Pro Gly Met Leu Ser Leu Asn 

40 4s 



Tyr Gin Asp Pro He Thr Thr Gin Asn Gly Ala 



55 



Leu Thr Leu Lys Leu 
60 



Gly ser Gly Leu Asn lie Asn Gin Asp Gly Glu Leu Thr Ser Asp Ala 

75 80 

ser Val Leu Val Thr Pro Pro He Thr Lys Ala Asn Asn Thr He Gly 

90 95 

^ Ma To Pr ° L6U Thr fS G1 " S « *-P ^r Leu Asn Leu 

105 110 

Ala Cys Asn Ala Pro Leu Thr Val Gin acn * 

115 Y" Gln Asp Asn Ar 9 L eu Gly He Thr 

120 125 

Tyr Asn ser Pro Leu Thr Leu Gin Asn Ser Glu Leu Ala Leu Ala Val 

135 .140 

Thr Pro Pro Leu Asp Thr Ala Asn Asn Thr Leu Ala Leu Lys Thr Ala 

155 160 

Arg Pro lie He Thr Asn Ser Asn Asn Glu Leu Thr Leu Ser Ala Asp 

170 175 

Ala Pro Leu Asn Thr Ser Thr Gly Thr Leu Ar g Leu Gin Ser Ala Ala 



185 



Pro Leu Gly Leu Val Asp Gin Thr Leu Arg Val Leu Phe 



200 



205 



190 



Ser Asn Pro 



Leu Tyr Leu Gin Asn Asn Phe Leu Ser Leu Ala lie 



215 



220 



Glu Arg Pro Leu 



Ala Leu Thr Thr Thr Gly Ser Met Ala Met Gin He Ser Gin Pro Leu 



235 



240 



Lys Val Glu Asp Gly Ser Leu Ser Leu Ser lie Glu 
245 250 



Ser Pro Leu Asn 
255 
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Leu Lys Aan Gly Asn Leu Thr Leu Gly Thr Gin Ser Pro Leu Thr Val 

2 60 " " — 



265 



270 



Thr Gly Asn Asn Leu Ser Leu Thr Thr Thr Ala Pro Leu Thr Val Gin 



280 285 
Asn Asn Ala Leu Ala Leu Ser Val Leu Leu Pro Leu Arg Leu Phe Asn 



295 



300 



Asn Thr Ser Leu Gly y«l Ala Phe Asn Pro Pro He Ser Ser Ala 



- -* Asn 

310 315 320 



Asn Gly Leu Ser Leu Asp lie Gly Asn Gly Leu Thr Leu Gin Tyr 
325 330 



Asn 



335 



Arg Leu Val Val Asn He Gly Gly Gly Leu Gin Phe Asn Asn Gly Ala 

345 350 



340 



He Thr Ala Ser lie Asn Ala Ala Leu Pro Leu Gin Tyr Ser Asn Asn 
355 360 3 65 

Gin Leu Ser Leu Asn lie Gly Gly Gly Leu Arg Tyr Asn Gly Thr Tyr 

375 38o 

Lys Asn Leu Ala Val Lys Thr Asp Ser Phe Arg Gly Leu Glu lie Asp 

Ser Asn Gin Phe Leu Val Pro Arg Leu Gly Ser Gly Leu Lys Phe Asp 

410 415 

Gin Tyr Gly Tyr lie Ser Val lie Pro Pro Thr Val Thr Pro Thr Thr 
420 «5 4 3o 

Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Ala Thr Phe Tyr Asp Ser 
435 440 445 

Leu Asp Ala Lys Val Trp Leu Ala Leu Val Lys Cys Asn Gly Met Val 



455 



460 



Asn Gly Thr lie Ala lie Lys Ala Leu Lys Gly Thr Leu Leu Gin Pro 
5 470 475 480 
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Thr Ala Ser Phe He Ser Phe Val Met Tyr Phe Tyr Ser Asn Gly Thr 



490 



495 



Arg Arg Thr Asn Tyr Pro Thr Phe Glu Asn Glu Gly He Leu Ala Ser 

5 °5 510 

Ser Ala Thr Trp Gly Tyr Arg Gin Gly Asn Ser Ala Asn Thr Asn Val 

520 525 

Thr Ser Ala Val Glu Phe Met Pro Ser Ser Thr Arg Tyr Pro Val Asn 

535 540 

Lys Gly Thr Glu Val Gin Asn Met Glu Leu Thr Tyr Thr Phe Leu Gin 

550 555 560 

Gly Asp Pro Thr Met Ala lie Ser Phe Gin Ala He Tyr Asn His Ala 
565 570 57S 

Leu Glu Gly Tyr Ser Leu Lys Phe Thr Trp Arg Val Arg Asn Arg Glu 

585 590 

Arg Phe Asp He Pro Cys Cys Ser Phe Ser Tyr He Thr Glu Glu 



600 



605 
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